The utility of geometrical and chemical restraint information extracted from predicted ligand-binding sites in protein structure refinement

Journal of Structural Biology - Tập 173 - Trang 558-569 - 2011
Michal Brylinski1, Seung Yup Lee1, Hongyi Zhou1, Jeffrey Skolnick1
1Center for the Study of Systems Biology, Georgia Institute of Technology, Atlanta, GA 30318, United States

Tài liệu tham khảo

Armistead, 1995, Design, synthesis and structure of non-macrocyclic inhibitors of FKBP12, the major binding protein for the immunosuppressant FK506, Acta Crystallogr. D: Biol. Crystallogr., 51, 522, 10.1107/S0907444994014502 Ashburner, 2000, Gene ontology: tool for the unification of biology. The Gene Ontology Consortium, Nat. Genet., 25, 25, 10.1038/75556 Aury, 2008, High quality draft sequences for prokaryotic genomes using a mix of new sequencing technologies, BMC Genomics, 9, 603, 10.1186/1471-2164-9-603 Berman, 2000, The Protein Data Bank, Nucleic Acids Res., 28, 235, 10.1093/nar/28.1.235 Turlach, 1993 Bindewald, 2005, A scoring function for docking ligands to low-resolution protein structures, J. Comput. Chem., 26, 374, 10.1002/jcc.20175 Brylinski, 2008, A threading-based method (FINDSITE) for ligand-binding site prediction and functional annotation, Proc. Natl. Acad. Sci. USA, 105, 129, 10.1073/pnas.0707684105 Brylinski, 2008, Q-Dock: low-resolution flexible ligand docking with pocket-specific threading restraints, J. Comput. Chem., 29, 1574, 10.1002/jcc.20917 Brylinski, 2009, Comparison of structure-based and threading-based approaches to protein functional annotation, Proteins, 78, 18, 10.1002/prot.22566 Brylinski, 2009, FINDSITE(LHM): a threading-based approach to ligand homology modeling, PLoS Comput. Biol., 5, e1000405, 10.1371/journal.pcbi.1000405 Brylinski, 2010, Q-Dock (LHM): low-resolution refinement for ligand comparative modeling, J. Comput. Chem., 31, 1093 Butcher, 2004, Systems biology in drug discovery, Nat. Biotechnol., 22, 1253, 10.1038/nbt1017 Chang, C.-C., Lin, C.-J., 2001. LIBSVM: a library for support vector machines Software available at <http://www.csie.ntu.edu.tw/≃cjlin/libsvm>. Chelliah, 2008, Functional site prediction selects correct protein models, BMC Bioinformatics, 1, S13, 10.1186/1471-2105-9-S1-S13 Chothia, 1986, The relation between the divergence of sequence and structure in proteins, EMBO J., 5, 823, 10.1002/j.1460-2075.1986.tb04288.x Cortes, 1995, Support-vector networks, Mach. Learn., 20, 273, 10.1007/BF00994018 Damm, 2006, Gaussian-weighted RMSD superposition of proteins: a structural comparison for flexible proteins and predicted protein structures, Biophys. J., 90, 4558, 10.1529/biophysj.105.066654 Davis, 2009, RosettaLigand docking with full ligand and receptor flexibility, J. Mol. Biol., 385, 381, 10.1016/j.jmb.2008.11.010 DeWeese-Scott, 2004, Molecular modeling of protein function regions, Proteins, 55, 942, 10.1002/prot.10519 Drucker, 1997 Dubowchik, 2001, 2-Aryl-2, 2-difluoroacetamide FKBP12 ligands: synthesis and X-ray structural studies, Org. Lett., 3, 3987, 10.1021/ol0166909 Dunbrack, 1993, Backbone-dependent rotamer library for proteins. Application to side-chain prediction, J. Mol. Biol., 230, 543, 10.1006/jmbi.1993.1170 Erickson, 2004, Lessons in molecular recognition: the effects of ligand and protein flexibility on molecular docking accuracy, J. Med. Chem., 47, 45, 10.1021/jm030209y Evers, 2003, Ligand-supported homology modelling of protein binding-sites using knowledge-based potentials, J. Mol. Biol., 334, 327, 10.1016/j.jmb.2003.09.032 Fan, 2004, Refinement of homology-based protein structures by molecular dynamics simulation techniques, Protein Sci., 13, 211, 10.1110/ps.03381404 Fan, 2009, Molecular docking screens using comparative models of proteins, J. Chem. Inf. Model., 49, 2512, 10.1021/ci9003706 Fiser, 2004, Protein structure modeling in the proteomics era, Expert Rev. Proteomics, 1, 97, 10.1586/14789450.1.1.97 Galat, 1993, Peptidylproline cis–trans-isomerases: immunophilins, Eur. J. Biochem., 216, 689, 10.1111/j.1432-1033.1993.tb18189.x Gao, 2009, From nonspecific DNA–protein encounter complexes to the prediction of DNA–protein interactions, PLoS Comput. Biol., 5, e1000341, 10.1371/journal.pcbi.1000341 Goodsell, 1996, Automated docking of flexible ligands: applications of AutoDock, J. Mol. Recognit., 9, 1, 10.1002/(SICI)1099-1352(199601)9:1<1::AID-JMR241>3.0.CO;2-6 Gopal, 2001, Homology-based annotation yields 1, 042 new candidate genes in the Drosophila melanogaster genome, Nat. Genet., 27, 337, 10.1038/85922 Hattori, 2003, Heuristics for chemical compound matching, Genome Inform., 14, 144 Heringa, 1999, Strain in protein structures as viewed through nonrotameric side chains: II. Effects upon ligand binding, Proteins, 37, 44, 10.1002/(SICI)1097-0134(19991001)37:1<44::AID-PROT5>3.0.CO;2-F Huang, 2006, Molecular mechanics methods for predicting protein–ligand binding, Phys. Chem. Chem. Phys., 8, 5166, 10.1039/B608269F Irwin, 2005, ZINC – a free database of commercially available compounds for virtual screening, J. Chem. Inf. Model., 45, 177, 10.1021/ci049714+ Jones, 2000, Threading methods for protein structure prediction, 1 Jones, 1996, A brief survey of bandwidth selection for density estimation, J. Amer. Stat. Assoc., 91, 401, 10.1080/01621459.1996.10476701 Juncker, 2009, Sequence-based feature prediction and annotation of proteins, Genome Biol., 10, 206, 10.1186/gb-2009-10-2-206 Karypis, G., 2003. CLUTO: A Clustering Toolkit, 2.1.1 ed. Kauffman, 2008, Improving homology models for protein–ligand binding sites, Comput. Syst. Bioinformatics Conf., 7, 211, 10.1142/9781848162648_0019 Kmiecik, 2007, Towards the high-resolution protein structure prediction. Fast refinement of reduced models with all-atom force field, BMC Struct. Biol., 7, 43, 10.1186/1472-6807-7-43 Koehl, 1994, Application of a self-consistent mean field theory to predict protein side-chains conformation and estimate their conformational entropy, J. Mol. Biol., 239, 249, 10.1006/jmbi.1994.1366 Kryshtafovych, 2005, Progress over the first decade of CASP experiments, Proteins, 61, 225, 10.1002/prot.20740 Levitt, 1997, Protein folding: the endgame, Annu. Rev. Biochem., 66, 549, 10.1146/annurev.biochem.66.1.549 Liang, 1998, Anatomy of protein pockets and cavities: measurement of binding site geometry and implications for ligand design, Protein Sci., 7, 1884, 10.1002/pro.5560070905 Loewenstein, 2009, Protein function annotation by homology-based inference, Genome Biol., 10, 207, 10.1186/gb-2009-10-2-207 Lyskov, 2008, The RosettaDock server for local protein–protein docking, Nucleic Acids Res., 36, W233, 10.1093/nar/gkn216 Mendes, 2001, Incorporating knowledge-based biases into an energy-based side-chain modeling method: application to comparative modeling of protein structure, Biopolymers, 59, 72, 10.1002/1097-0282(200108)59:2<72::AID-BIP1007>3.0.CO;2-S Moult, 2009, Critical assessment of methods of protein structure prediction – round VIII, Proteins, 77, 1, 10.1002/prot.22589 Moult, 2007, Critical assessment of methods of protein structure prediction – Round VII, Proteins, 69, 3, 10.1002/prot.21767 Moustakas, 2006, Development and validation of a modular, extensible docking program: DOCK 5, J. Comput. Aided Mol. Des., 20, 601, 10.1007/s10822-006-9060-4 Murray, 1999, The sensitivity of the results of molecular docking to induced fit effects: application to thrombin, thermolysin and neuraminidase, J. Comput. Aided Mol. Des., 13, 547, 10.1023/A:1008015827877 Najmanovich, 2000, Side-chain flexibility in proteins upon ligand binding, Proteins, 39, 261, 10.1002/(SICI)1097-0134(20000515)39:3<261::AID-PROT90>3.0.CO;2-4 O’Toole, 2003, Coverage of protein sequence space by current structural genomics targets, J. Struct. Funct. Genomics, 4, 47, 10.1023/A:1026156025612 Pandit, 2008, Fr-TM-align: a new protein structural alignment method based on fragment alignments and the TM-score, BMC Bioinformatics, 9, 531, 10.1186/1471-2105-9-531 Panjkovich, 2010, Assessing the structural conservation of protein pockets to study functional and allosteric sites: implications for drug discovery, BMC Struct. Biol., 10, 9, 10.1186/1472-6807-10-9 Parzen, 1962, On estimation of a probability density function and mode, Ann. Math. Stat., 33, 1065, 10.1214/aoms/1177704472 Pencheva, 2008, AMMOS: Automated Molecular Mechanics Optimization tool for in silico Screening, BMC Bioinformatics, 9, 438, 10.1186/1471-2105-9-438 Piedra, 2008, Preservation of protein clefts in comparative models, BMC Struct. Biol., 8, 2, 10.1186/1472-6807-8-2 Pils, 2005, Variation in structural location and amino acid conservation of functional sites in protein domain families, BMC Bioinformatics, 6, 210, 10.1186/1471-2105-6-210 Rahman, 2009, Small Molecule Subgraph Detector (SMSD) toolkit, J. Cheminform., 1, 12, 10.1186/1758-2946-1-12 Rajamani, 2007, Ranking poses in structure-based lead discovery and optimization: current trends in scoring function development, Curr. Opin. Drug Discov. Dev., 10, 308 Rost, 2003, Automatic prediction of protein function, Cell Mol. Life Sci., 60, 2637, 10.1007/s00018-003-3114-8 Rotkiewicz, 2008, Fast procedure for reconstruction of full-atom protein models from reduced representations, J. Comput. Chem., 29, 1460, 10.1002/jcc.20906 Sali, 1993, Comparative protein modelling by satisfaction of spatial restraints, J. Mol. Biol., 234, 779, 10.1006/jmbi.1993.1626 Seifert, 2007, Virtual high-throughput screening of molecular databases, Curr. Opin. Drug Discov. Dev., 10, 298 Skolnick, 2001, Defrosting the frozen approximation: PROSPECTOR – a new approach to threading, Proteins, 42, 319, 10.1002/1097-0134(20010215)42:3<319::AID-PROT30>3.0.CO;2-A Skolnick, 2009, FINDSITE: a combined evolution/structure-based approach to protein function prediction, Brief Bioinformatics, 10, 378, 10.1093/bib/bbp017 Skolnick, 2004, Development and large scale benchmark testing of the PROSPECTOR_3 threading algorithm, Proteins, 56, 502, 10.1002/prot.20106 Sutherland, 2007, Lessons in molecular recognition. 2. Assessing and improving cross-docking accuracy, J. Chem. Inf. Model., 47, 2293, 10.1021/ci700253h Tanimoto, T.T., 1958. An Elementary Mathematical Theory of Classification and Prediction. IBM Internal Report. Teodoro, 2003, Conformational flexibility models for the receptor in structure based drug design, Curr. Pharm. Des., 9, 1635, 10.2174/1381612033454595 Tettelin, 2009, Bacterial genome sequencing, Meth. Mol. Biol., 551, 231, 10.1007/978-1-60327-999-4_18 Vakser, 1996, Low-resolution docking: prediction of complexes for underdetermined structures, Biopolymers, 39, 455, 10.1002/(SICI)1097-0282(199609)39:3<455::AID-BIP16>3.3.CO;2-8 van Dijk, 2008, A protein–DNA docking benchmark, Nucleic Acids Res., 36, e88, 10.1093/nar/gkn386 Voigt, 2001, Comparison of the NCI open database with seven large chemical structural databases, J. Chem. Inf. Comput. Sci., 41, 702, 10.1021/ci000150t Wallach, 2009, The protein-small-molecule database, a non-redundant structural resource for the analysis of protein–ligand binding, Bioinformatics, 25, 615, 10.1093/bioinformatics/btp035 Weisel, 2009, Form follows function: shape analysis of protein cavities for receptor-based drug design, Proteomics, 9, 451, 10.1002/pmic.200800092 Wheeler, 2008, The complete genome of an individual by massively parallel DNA sequencing, Nature, 452, 872, 10.1038/nature06884 Wiehe, 2008, Protein–protein docking: overview and performance analysis, Meth. Mol. Biol., 413, 283 Wilson, 1993, Modeling side-chain conformation for homologous proteins using an energy-based rotamer search, J. Mol. Biol., 229, 996, 10.1006/jmbi.1993.1100 Wojciechowski, 2002, Docking of small ligands to low-resolution and theoretically predicted receptor structures, J. Comput. Chem., 23, 189, 10.1002/jcc.1165 Wroblewska, 2008, Development of a physics-based force field for the scoring and refinement of protein models, Biophys. J., 94, 3227, 10.1529/biophysj.107.121947 Wu, 2003, Detailed analysis of grid-based molecular docking: A case study of CDOCKER-A CHARMm-based MD docking algorithm, J. Comput. Chem., 24, 1549, 10.1002/jcc.10306 Xie, 2005, Functional coverage of the human genome by existing structures, structural genomics targets, and homology models, PLoS Comput. Biol., 1, e31, 10.1371/journal.pcbi.0010031 Xue, 2003, Profile scaling increases the similarity search performance of molecular fingerprints containing numerical descriptors and structural keys, J. Chem. Inf. Comput. Sci., 43, 1218, 10.1021/ci030287u You, 2004, Toward computational systems biology, Cell Biochem. Biophys., 40, 167, 10.1385/CBB:40:2:167 Yura, 2006, Coverage of whole proteome by structural genomics observed through protein homology modeling database, J. Struct. Funct. Genomics, 7, 65, 10.1007/s10969-006-9010-3 Zhang, 2004, Automated structure prediction of weakly homologous proteins on a genomic scale, Proc. Natl. Acad. Sci. USA, 101, 7594, 10.1073/pnas.0305695101 Zhang, 2004, Scoring function for automated assessment of protein structure template quality, Proteins, 57, 702, 10.1002/prot.20264 Zhang, 2005, TM-align: a protein structure alignment algorithm based on the TM-score, Nucleic Acids Res., 33, 2302, 10.1093/nar/gki524 Zhang, 2005, The protein structure prediction problem could be solved using the current PDB library, Proc. Natl. Acad. Sci. USA, 102, 1029, 10.1073/pnas.0407152101 Zhou, 2004, Single-body residue-level knowledge-based energy score combined with sequence-profile and secondary structure information for fold recognition, Proteins, 55, 1005, 10.1002/prot.20007 Zhou, 2005, Fold recognition by combining sequence profiles derived from evolution and from depth-dependent structural alignment of fragments, Proteins, 58, 321, 10.1002/prot.20308 Zhou, 2007, Ab initio protein structure prediction using chunk-TASSER, Biophys. J., 93, 1510, 10.1529/biophysj.107.109959