Recognizing ion ligand binding sites by SMO algorithm
Tóm tắt
In many important life activities, the execution of protein function depends on the interaction between proteins and ligands. As an important protein binding ligand, the identification of the binding site of the ion ligands plays an important role in the study of the protein function. In this study, four acid radical ion ligands (NO2−,CO32−,SO42−,PO43−) and ten metal ion ligands (Zn2+,Cu2+,Fe2+,Fe3+,Ca2+,Mg2+,Mn2+,Na+,K+,Co2+) are selected as the research object, and the Sequential minimal optimization (SMO) algorithm based on sequence information was proposed, better prediction results were obtained by 5-fold cross validation. An efficient method for predicting ion ligand binding sites was presented.
Tài liệu tham khảo
Leustek T, Murillo M, Cervantes M. Cloning of a cDNA encoding ATP sulfurylase from Arabidopsis thaliana by functional expression in Saccharomyces cerevisiae [J]. Plant Physiol. 1994;105(3):897–902.
Monigatti F, Gasteiger E, Bairoch A, et al. The Sulfinator: predicting tyrosine sulfation sites in protein sequences [J]. Bioinformatics. 2002;18(5):769–70.
Hatzfeld Y, Lee S, Lee M, et al. Functional characterization of a gene encoding a fourth ATP sulfurylase isoform from Arabidopsis thaliana [J]. Gene. 2000;248(1):51–8.
Lv X, Tan X. Metals homeostasis and related proteins in Alzheimer's disease [J]. Progress in Chemistry. 2013;25(4):511–9.
Bao W, Jiang Z, Huang DS. Novel human microbe-disease association prediction using network consistency projection [J]. BMC Bioinformatics. 2017;18(S16):543.
Deng SP, Cao S, Huang DS, et al. Identifying stages of kidney renal cell carcinoma by combining gene expression and DNA methylation data [J]. IEEE/ACM Trans Comput Biol Bioinform. 2017;14(5):1147–53.
Guo W, Zhu L, Deng S, et al. Understanding tissue-specificity with human tissue-specific regulatory networks [J]. Science China Inf Sci. 2016;59(7):070105.
Deng SP, Zhu L, Huang DS. Predicting Hub Genes Associated with Cervical Cancer through Gene Co-Expression Networks[J]. IEEE/ACM Transactions on Computational Biology and Bioinformatics. 2016:13(1):27–35.
Deng SP, Zhu L, Huang DS. Mining the bladder cancer-associated genes by an integrated strategy for the construction and analysis of differential co-expression networks [J]. BMC Genomics. 2015;16(3 Supplement):S4.
Huang DS, Zheng CH. Independent component analysis-based penalized discriminant method for tumor classification using gene expression data [J]. Bioinformatics. 2006;22(15):1855–62.
Warner RG, Hundt C, Weiss S, et al. Identification of the heparan sulfate binding sites in the cellular prion protein [J]. J Biol Chem. 2002;277(21):18421–30.
Li S, Hu X, et al. Identifying the sulfate ion binding residues in proteins [J]. International Conference on Biomedical & Biological Engineering, 2017.
Yang J, Roy A, Zhang Y. BioLiP: a semi-manually curated database for biologically relevant ligand-protein interactions [J]. Nucleic Acids Res. 2013;41(Database issue):1096–103.
Sobolev V, Edelman M. Web tools for predicting metal binding sites in proteins [J]. Israel J Chemistry. 2013;53(3–4):166–72.
Lu CH, Lin YF, Lin JJ, et al. Prediction of metal ion–binding sites in proteins using the fragment transformation method [J]. PLoS One. 2012;7(6):e39252.
Hu X, Wang K, Dong Q. Protein ligand-specific binding residue predictions by an ensemble classifier [J]. BMC Bioinformatics. 2016;17(1):470.
Cao X, Hu X, Zhang X, et al. Identification of metal ion binding sites based on amino acid sequences [J]. PLoS One. 2017;12(8):e0183756.
Greenside P, Hillenmeyer M, Kundaje A. Prediction of protein-ligand interactions from paired protein sequence motifs and ligand substructures. In: Pacific symposium; 2018.
Liu T, Lin Y, Wen X, et al. BindingDB: a web-accessible database of experimentally determined protein-ligand binding affinities [J]. Nucleic Acids Res. 2007;35(Database issue):198–201.
Panek J, Eidhammer IR. A new method for identification of protein (sub) families in a set of proteins based on hydropathy distribution in proteins [J]. Proteins-structure Funct Bioinformatics. 2005;58(4):923–34.
Taylor WR. The classification of amino acid conservation.[J]. J Theor Biol. 1986;119(2):205–18.
Chen H. Prediction of solvent accessibility and sites of deleterious mutations from protein sequence [J]. Nucleic Acids Res. 2005;33(10):3193–9.
Wu S, Zhang Y. ANGLOR: a composite machine-learning algorithm for protein backbone torsion angle prediction [J]. PLoS One. 2008;3(10):e3400.
Kel AE. E. Gößling, Reuter I, et al. MATCHTM: a tool for searching transcription factor binding sites in DNA sequences [J]. Nucleic Acids Res. 2003;31(13):3576–9.
Hu X, Li Q. Using support vector machine to predict - and -turns in proteins [J]. J Comput Chem. 2010;29(12):1867–75.
Zhenxing F, Xiuzhen H. Recognition of 27-class protein folds by adding the interaction of segments and motif information [J]. Biomed Res Int. 2014;2014:1–9.
Lei L, Xiuzhen H. Predicting protein fold types by the general form of Chou’s pseudo amino acid composition: approached from optimal feature extractions [J]. Protein Pept Lett. 2012;19:439–49.
Feng ZX, Li QZ. Recognition of long-range enhancer-promoter interactions by adding genomic signatures of segmented regulatory regions [J]. Genomics. 2017;109(5–6):341.
Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH. The WEKA data mining software: an update. ACM SIGKDD Explor Newsl. 2009;11:10–8.
Üstün B, Melssen W, Buydens L, et al. Facilitating the application of support vector regression by using a universal Pearson VII function based kernel [J]. Chemometrics Intell Lab Syst. 2006;81(1):29–40.
Sun T, Zhou B, Lai L, et al. Sequence-based prediction of protein protein interaction using a deep-learning algorithm [J]. Bioinformatics. 2017;18(1):277.
Jiang Z, Hu XZ, Geriletu G, et al. Identification of Ca2+-binding residues of a protein from its primary sequence [J]. Genet Mol Res. 2016;15(2):gmr.15027618.
Hu X, Dong Q, Yang J, et al. Recognizing metal and acid radical ion-binding sites by integrating ab initio modeling with template-based transferals [J]. Bioinformatics. 2016;32(21):3260.
Tao W, Liping L, Yu-An H, et al. Prediction of Protein-Protein Interactions from Amino Acid Sequences Based on Continuous and Discrete Wavelet Transform Features[J]. Molecules. 2018:23(4):823–37.
Yi HC, You ZH, Huang DS, et al. A Deep Learning Framework for Robust and Accurate prediction of ncRNA-Protein Interactions using Evolutionary Information[J]. Mol Ther - Nucleic Acids. 2018:11:337–44.