A protein sequence meta-functional signature for calcium binding residue prediction

Pattern Recognition Letters - Tập 31 - Trang 2103-2112 - 2010
Jeremy A. Horst, Ram Samudrala

Tài liệu tham khảo

Abagyan, 2009, The flexible pocketome engine for structural chemogenomics, Methods Mol. Biol., 575, 249, 10.1007/978-1-60761-274-2_11 Altschul, 1997, Gapped BLAST and PSI-BLAST: A new generation of protein database search programs, Nucleic Acids Res., 25, 3389, 10.1093/nar/25.17.3389 Ashworth, 2006, Computational redesign of endonuclease DNA binding and cleavage specificity, Nature, 441, 656, 10.1038/nature04818 Berman, 2007, The worldwide Protein Data Bank (wwPDB): Ensuring a single, uniform archive of PDB data, Nucleic Acids Res., 35, D301, 10.1093/nar/gkl971 Biegert, 2009, Sequence context-specific profiles for homology searching, Proc. Natl. Acad. Sci. USA, 106, 3770, 10.1073/pnas.0810767106 Bork, 1998, Predicting function: From genes to genomes and back, J. Mol. Biol., 283, 707, 10.1006/jmbi.1998.2144 Chen, 2004, TargetDB: A target registration database for structural genomics projects, Bioinformatics, 20, 2860, 10.1093/bioinformatics/bth300 Cheng, 2005, Improvement in protein functional site prediction by distinguishing structural and functional constraints on protein family evolution using computational design, Nucleic Acids Res., 33, 5861, 10.1093/nar/gki894 Deng, 2006, Predicting calcium-binding sites in proteins – A graph theory and geometry approach, Proteins, 64, 34, 10.1002/prot.20973 Eddy, 1998, Profile hidden Markov models, Bioinformatics, 14, 755, 10.1093/bioinformatics/14.9.755 Edgar, 2004, MUSCLE: Multiple sequence alignment with high accuracy and high throughput, Nucleic Acids Res., 32, 1792, 10.1093/nar/gkh340 Felsenstein, 1981, Evolutionary trees from DNA sequences: A maximum likelihood approach, J. Mol. Evol., 17, 368, 10.1007/BF01734359 Fetrow, 1998, Method for prediction of protein function from sequence using the sequence-to-structure-to-function paradigm with application to glutaredoxins/thioredoxins and T1 ribonucleases, J. Mol. Biol., 281, 949, 10.1006/jmbi.1998.1993 Fischer, 2008, Prediction of protein functional residues from sequence by probability density estimation, Bioinformatics, 24, 613, 10.1093/bioinformatics/btm626 Fleming, 2006, The proteome: Structure, function and evolution, Philos. Trans. Roy. Soc. Lond. B – Biol. Sci., 29, 441, 10.1098/rstb.2005.1802 Ge, 2003, Integrating ‘omic’ information: A bridge between genomics and systems biology, Trends Genet., 19, 551, 10.1016/j.tig.2003.08.009 Gutteridge, 2005, Understanding nature’s catalytic toolkit, Trends Biochem. Sci., 30, 622, 10.1016/j.tibs.2005.09.006 Horst, J.A., Samudrala, R., 2009. Diversity of protein structures and difficulties in fold recognition: The curious case of protein G. F1000 Biology Reports, vol. 1, p. 69. Jensen, 1974, Enzyme recruitment in evolution of new function, Annu. Rev. Microbiol., 30, 409, 10.1146/annurev.mi.30.100176.002205 Jiang, 2008, De novo computational design of retro-aldol enzymes, Science, 319, 1387, 10.1126/science.1152692 Jones, 1999, Protein secondary structure prediction based on position-specific scoring matrices, J. Mol. Biol., 292, 195, 10.1006/jmbi.1999.3091 Khersonsky, 2006, Enzyme promiscuity: Evolutionary and mechanistic aspects, Curr. Opin. Chem. Biol., 10, 498, 10.1016/j.cbpa.2006.08.011 Laskowski, 2005, Protein function prediction using local 3D templates, J. Mol. Biol., 351, 614, 10.1016/j.jmb.2005.05.067 Lee, 2007, Endocrine regulation of energy metabolism by the skeleton, Cell, 130, 456, 10.1016/j.cell.2007.05.047 Lopez, 2009, Assessment of ligand binding residue predictions in CASP8, Proteins, 77, 138, 10.1002/prot.22557 Margelevicius, 2005, PSI-BLAST-ISS: An intermediate sequence search tool for estimation of the position-specific alignment reliability, BMC Bioinform., 6, 185, 10.1186/1471-2105-6-185 McDermott, 2005, Functional annotation from predicted protein interaction networks, Bioinformatics, 21, 3217, 10.1093/bioinformatics/bti514 Mihalek, 2004, A family of evolution – Entropy hybrid methods for ranking protein residues by importance, J. Mol. Biol., 336, 1265, 10.1016/j.jmb.2003.12.078 Moult, 2005, A decade of CASP: Progress, bottlenecks and prognosis in protein structure prediction, Curr. Opin. Struct. Biol., 15, 285, 10.1016/j.sbi.2005.05.011 O’Day, 2003, CaMBOT: Profiling and characterizing calmodulin-binding proteins, Cell. Signal., 15, 347, 10.1016/S0898-6568(02)00116-X Protein Data Bank. Research Collaboratory for Structural Bioinformatics. <http://www.pdb.org> (accessed 17.07.09). Protein Structure Initiative. Structural Genomics Knowledgebase: TargetDB Statistics Summary Report. <http://targetdb.pdb.org/statistics/TargetStatistics.html> (accessed 11.11.09). Pruitt, 2005, NCBI Reference Sequence (RefSeq): A curated non-redundant sequence database of genomes, transcripts and proteins, Nucl Acids Res., 33, D501, 10.1093/nar/gki025 Raman, 2009, Structure prediction for CASP8 with all-atom refinement using Rosetta, Proteins, 77, 89, 10.1002/prot.22540 Reeves, 2009, Genome and proteome annotation: Organization, interpretation and integration, J. Roy. Soc. Interface, 6, 129, 10.1098/rsif.2008.0341 Shoemaker, 2000, Speeding molecular recognition by using the folding funnel: The fly-casting mechanism, Proc. Natl. Acad. Sci. USA, 97, 8868, 10.1073/pnas.160259697 Sterner, 2007, Predicting and annotating catalytic residues: An information theoretic approach, J. Comput. Biol., 14, 1058, 10.1089/cmb.2007.0042 Stone, 2005, Physicochemical constraint violation by missense substitutions mediates impairment of protein function and disease severity, Genome Res., 15, 978, 10.1101/gr.3804205 Tordoff, 2001, Calcium: Taste, intake, and appetite, Physiol. Rev., 81, 1567, 10.1152/physrev.2001.81.4.1567 US Department of Energy Joint Genome Institute: Intergrated Microbial Genomes. <http://img.jgi.doe.gov> (accessed 18.11.09). Wang, 2006, Incorporating background frequency improves entropy-based residue conservation measures, BMC Bioinform., 7, 385, 10.1186/1471-2105-7-385 Wang, 2008, Protein meta-functional signatures from combining sequence, structure, evolution and amino acid property information, PLoS Comput. Biol., 4, e1000181, 10.1371/journal.pcbi.1000181 Wang, 2009, Towards predicting Ca2+-binding sites with different coordination numbers in proteins with atomic resolution, Proteins, 75, 787, 10.1002/prot.22285 Zhang, 2008, Progress and challenges in protein structure prediction, Curr. Opin. Struct. Biol., 18, 342, 10.1016/j.sbi.2008.02.004 Zhang, 2008, Accurate sequence-based prediction of catalytic residues, Bioinformatics, 24, 2329, 10.1093/bioinformatics/btn433