iDBP-PBMD: A machine learning model for detection of DNA-binding proteins by extending compression techniques into evolutionary profile

Chemometrics and Intelligent Laboratory Systems - Tập 231 - Trang 104697 - 2022
Ameen Banjar1, Farman Ali2, Omar Alghushairy1, Ali Daud3
1Department of Information Systems and Technology, College of Computer Science and Engineering, University of Jeddah, Jeddah, Saudi Arabia
2Department of Elementary and Secondary Education, Peshawar, Khyber Pakhtunkhwa, Pakistan
3Abu Dhabi School of Management, Abu Dhabi, United Arab Emirates

Tài liệu tham khảo

Ali, 2022 Luscombe, 2000, An overview of the structures of protein-DNA complexes, Genome Biol., 1, 10.1186/gb-2000-1-1-reviews001 Ali, 2020, SDBP-Pred: prediction of single-stranded and double-stranded DNA-binding proteins by extending consensus sequence and K-segmentation strategies into PSSM, Anal. Biochem., 589, 10.1016/j.ab.2019.113494 Sandman, 1998, Diversity of prokaryotic chromosomal proteins and the origin of the nucleosome, Cellular and Molecular Life Sciences CMLS, 54, 1350, 10.1007/s000180050259 Ali, 2022, Deep-PCL: a deep learning model for prediction of cancerlectins and non cancerlectins using optimized integrated features, Chemometr. Intell. Lab. Syst., 221 Ali, 2021, AFP-CMBPred: computational identification of antifreeze proteins by extending consensus sequences into multi-blocks evolutionary information, Comput. Biol. Med., 139, 10.1016/j.compbiomed.2021.105006 Al-Lazikani, 2006, How many drug targets are there, Nat. Rev. Drug Discov., 5, 993, 10.1038/nrd2199 Gronemeyer, 2004, Principles for modulation of the nuclear receptor superfamily, Nat. Rev. Drug Discov., 3, 950, 10.1038/nrd1551 Hudson, 2018, Cryptic glucocorticoid receptor-binding sites pervade genomic NF-κB response elements, Nat. Commun., 9, 1337, 10.1038/s41467-018-03780-1 Tran, 2018, Nuclear receptors and liver disease: summary of the 2017 basic research symposium, Hepatology communications, 2, 765, 10.1002/hep4.1203 Ahmad, 2020 Dong, 2015, Identification of DNA-binding proteins by auto-cross covariance transformation, 470 Wei, 2017, Local-DPP: an improved DNA-binding protein prediction method by exploring local evolutionary information, Inf. Sci., 384, 135, 10.1016/j.ins.2016.06.026 Ali, 2018, DBPPred-PDSD: machine learning approach for prediction of DNA-binding proteins using Discrete Wavelet Transform and optimized integrated features space, Chemometr. Intell. Lab. Syst., 182, 21, 10.1016/j.chemolab.2018.08.013 Ali, 2019, DP-BINDER: machine learning model for prediction of DNA-binding proteins by fusing evolutionary and physicochemical information, J. Comput. Aided Mol. Des., 33, 645, 10.1007/s10822-019-00207-x Ali, 2022, Target-DBPPred: an intelligent model for prediction of DNA-binding proteins using discrete wavelet transform based compression and light eXtreme gradient boosting, Comput. Biol. Med., 145, 10.1016/j.compbiomed.2022.105533 Rahman, 2018, DPP-PseAAC: a DNA-binding protein prediction model using Chou's general PseAAC, J. Theor. Biol., 452, 22, 10.1016/j.jtbi.2018.05.006 Mishra, 2018, StackDPPred: a stacking based prediction of DNA-binding protein from sequence, Bioinformatics, 35, 433, 10.1093/bioinformatics/bty653 Hu, 2019 Ahmad, 2021, Deep-AntiFP: prediction of antifungal peptides using distanct multi-informative features incorporating with deep neural networks, Chemometr. Intell. Lab. Syst., 208, 10.1016/j.chemolab.2020.104214 Ahmad, 2022 Akbar, 2021, iAtbP-hyb-EnC: prediction of antitubercular peptides via heterogeneous feature representation and genetic algorithm based ensemble learning model, Comput. Biol. Med., 10.1016/j.compbiomed.2021.104778 Akbar, 2020, iHBP-DeepPSSM: identifying hormone binding proteins using PsePSSM based evolutionary features and deep learning approach, Chemometr. Intell. Lab. Syst., 204, 10.1016/j.chemolab.2020.104103 Li, 2003, Reduction of protein sequence complexity by residue grouping, Protein Eng., 16, 323, 10.1093/protein/gzg044 Ali, 2015, Classification of membrane protein types using voting feature interval in combination with chou׳ s pseudo amino acid composition, J. Theor. Biol., 384, 78, 10.1016/j.jtbi.2015.07.034 Yu, 2017, Prediction subcellular localization of Gram-negative bacterial proteins by support vector machine using wavelet denoising and Chou's pseudo amino acid composition, Chemometr. Intell. Lab. Syst., 167, 102, 10.1016/j.chemolab.2017.05.009 Wei, 2016, Exploring local discriminative information from evolutionary profiles for cytokine–receptor interaction prediction, Neurocomputing, 217, 37, 10.1016/j.neucom.2016.02.078 Dehzangi, 2013, Exploring potential discriminatory information embedded in pssm to enhance protein structural class prediction accuracy, 208 Ahmed, 1974, Discrete cosine transform, IEEE Trans. Comput., 100, 90, 10.1109/T-C.1974.223784 Barukab, 2022, DBP-CNN: deep learning-based prediction of DNA-binding proteins by coupling discrete cosine transform with two-dimensional convolutional neural network, Expert Syst. Appl., 197, 10.1016/j.eswa.2022.116729 Rao, 1990 Khan, 2019, iPredCNC: computational prediction model for cancerlectins and non-cancerlectins using novel cascade features subset selection, Chemometr. Intell. Lab. Syst., 195, 10.1016/j.chemolab.2019.103876 Shen, 2017, Identification of DNA–protein binding sites through multi-scale local average blocks on sequence information, Molecules, 22, 2079, 10.3390/molecules22122079 Khan, 2021, piEnPred: a bi-layered discriminative model for enhancers and their subtypes via novel cascade multi-level subset feature selection algorithm, Front. Comput. Sci., 15, 1, 10.1007/s11704-020-9504-3 Saini, 2016, Protein fold recognition using genetic algorithm optimized voting scheme and profile bigram, J. SW, 11, 756 Khan, 2019, iRSpot-SPI: deep learning-based recombination spots prediction by incorporating secondary sequence information coupled with physio-chemical properties via Chou's 5-step rule and pseudo components, Chemometr. Intell. Lab. Syst., 189, 169, 10.1016/j.chemolab.2019.05.003 Sikander, 2022, XGB-DrugPred: computational prediction of druggable proteins using eXtreme gradient boosting and optimized features set, Sci. Rep., 12, 1, 10.1038/s41598-022-09484-3 Ali, 2016, Machine learning approaches for discrimination of Extracellular Matrix proteins using hybrid feature space, J. Theor. Biol., 403, 30, 10.1016/j.jtbi.2016.05.011 Chen, 2016, Xgboost: a scalable tree boosting system, 785 Khan, 2021, 1 Wang, 2010, Predicting subcellular localization of gram-negative bacterial proteins by linear dimensionality reduction method, Protein Pept. Lett., 17, 32, 10.2174/092986610789909494 Liu, 2014, iDNA-Prot| dis: identifying DNA-binding proteins by incorporating amino acid distance-pairs and reduced alphabet profile into the general pseudo amino acid composition, PLoS One, 9 Hu, 2019, TargetDBP: accurate DNA-binding protein prediction via sequence-based multi-view feature learning, IEEE ACM Trans. Comput. Biol. Bioinf, 17, 1419 Du, 2019, MsDBP: exploring DNA-binding proteins by integrating multiscale sequence information via Chou's five-step rule, J. Proteome Res., 18, 3119, 10.1021/acs.jproteome.9b00226 Zhao, 2022, Identify DNA-binding proteins through the extreme gradient boosting algorithm, Front. Genet., 12, 10.3389/fgene.2021.821996