Identification of cavities on protein surface using multiple computational approaches for drug binding site prediction

Bioinformatics - Tập 27 Số 15 - Trang 2083-2088 - 2011
Zengming Zhang1, Yu Li1, Biaoyang Lin1, Michael Schroeder1, Bingding Huang1
11 Systems Biology Division, Zhejiang-California International NanoSystems Institute, Zhejiang University, 310029 Hangzhou, China and 2Bioinformatics Group, Biotechnology Center, Technical University of Dresden, 01307, Dresden, Germany

Tóm tắt

Abstract Motivation: Protein–ligand binding sites are the active sites on protein surface that perform protein functions. Thus, the identification of those binding sites is often the first step to study protein functions and structure-based drug design. There are many computational algorithms and tools developed in recent decades, such as LIGSITEcs/c, PASS, Q-SiteFinder, SURFNET, and so on. In our previous work, MetaPocket, we have proved that it is possible to combine the results of many methods together to improve the prediction result. Results: Here, we continue our previous work by adding four more methods Fpocket, GHECOM, ConCavity and POCASA to further improve the prediction success rate. The new method MetaPocket 2.0 and the individual approaches are all tested on two datasets of 48 unbound/bound and 210 bound structures as used before. The results show that the average success rate has been raised 5% at the top 1 prediction compared with previous work. Moreover, we construct a non-redundant dataset of drug–target complexes with known structure from DrugBank, DrugPort and PDB database and apply MetaPocket 2.0 to this dataset to predict drug binding sites. As a result, >74% drug binding sites on protein target are correctly identified at the top 3 prediction, and it is 12% better than the best individual approach. Availability: The web service of MetaPocket 2.0 and all the test datasets are freely available at http://projects.biotec.tu-dresden.de/metapocket/ and http://sysbio.zju.edu.cn/metapocket. Contact:  [email protected] Supplementary Information:  Supplementary data are available at Bioinformatics online.

Từ khóa


Tài liệu tham khảo

Binkowski, 2003, CASTp: Computed Atlas of Surface Topography of proteins, Nucleic Acids Res., 31, 3352, 10.1093/nar/gkg512

Brady, 2000, Fast prediction and visualization of protein binding pockets with PASS, J. Comput. Aided Mol. Des., 14, 383, 10.1023/A:1008124202956

Capra, 2009, Predicting protein ligand binding sites by combining evolutionary sequence conservation and 3D structure, PLoS Comput. Biol., 5, e1000585, 10.1371/journal.pcbi.1000585

Cheng, 2007, Structure-based maximal affinity model predicts small-molecule druggability, Nat. Biotechnol., 25, 71, 10.1038/nbt1273

Dundas, 2006, CASTp: computed atlas of surface topography of proteins with structural and topographical mapping of functionally annotated residues, Nucleic Acids Res., 34, W116, 10.1093/nar/gkl282

Edelsbrunner, 1994, Three-dimensional alpha shapes, ACM Trans. Graph., 13, 43, 10.1145/174462.156635

Ghersi, 2009, EasyMIFS and SiteHound: a toolkit for the identification of ligand-binding sites in protein structures, Bioinformatics, 25, 3185, 10.1093/bioinformatics/btp562

Hajduk, 2005, Druggability indices for protein targets derived from NMR-based screening data, J. Med. Chem., 48, 2518, 10.1021/jm049131r

Hajduk, 2005, Predicting protein druggability, Drug Discov. Today, 10, 1675, 10.1016/S1359-6446(05)03624-X

Hendlich, 1997, LIGSITE: automatic and efficient detection of potential small molecule-binding sites in proteins, J. Mol. Graph. Model., 15, 359, 10.1016/S1093-3263(98)00002-3

Hernandez, 2009, SITEHOUND-web: a server for ligand binding site identification in protein structures, Nucleic Acids Res., 37, W413, 10.1093/nar/gkp281

Huang, 2009, MetaPocket: a meta approach to improve protein ligand binding site prediction, OMICS, 13, 325, 10.1089/omi.2009.0045

Huang, 2006, LIGSITEcsc: predicting ligand binding sites using the Connolly surface and degree of conservation, BMC Struct. Biol., 6, 19, 10.1186/1472-6807-6-19

Huang, 2010, CD-HIT Suite: a web server for clustering and comparing biological sequences, Bioinformatics, 26, 680, 10.1093/bioinformatics/btq003

Kawabata, 2010, Detection of multiscale pockets on protein surfaces using mathematical morphology, Proteins, 78, 1195, 10.1002/prot.22639

Kawabata, 2007, Detection of pockets on protein surfaces using small and large probe spheres to find putative ligand binding sites, Proteins, 68, 516, 10.1002/prot.21283

Laskowski, 1995, SURFNET: a program for visualizing molecular surfaces, cavities, and intermolecular interactions, J. Mol. Graph., 13, 323, 10.1016/0263-7855(95)00073-9

Laurie, 2005, Q-SiteFinder: an energy-based method for the prediction of protein-ligand binding sites, Bioinformatics, 21, 1908, 10.1093/bioinformatics/bti315

Le Guilloux, 2009, Fpocket: an open source platform for ligand pocket detection, BMC Bioinformatics, 10, 168, 10.1186/1471-2105-10-168

Li, 2006, Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences, Bioinformatics, 22, 1658, 10.1093/bioinformatics/btl158

Nayal, 2006, On the nature of cavities on protein surfaces: application to the identification of drug-binding sites, Proteins, 63, 892, 10.1002/prot.20897

Schmidtke, 2010, Understanding and predicting druggability. A high-throughput method for detection of drug binding sites, J. Med. Chem., 53, 5858, 10.1021/jm100574m

Sugaya, 2009, Assessing the druggability of protein-protein interactions by a supervised machine-learning method, BMC Bioinformatics, 10, 263, 10.1186/1471-2105-10-263

Weisel, 2007, PocketPicker: analysis of ligand binding-sites with shape descriptors, Chem. Cent. J., 1, 7, 10.1186/1752-153X-1-7

Wishart, 2006, DrugBank: a comprehensive resource for in silico drug discovery and exploration, Nucleic Acids Res., 34, D668, 10.1093/nar/gkj067

Wishart, 2008, DrugBank: a knowledgebase for drugs, drug actions and drug targets, Nucleic Acids Res., 36, D901, 10.1093/nar/gkm958

Yu, 2010, Roll: a new algorithm for the detection of protein pockets and cavities with a rolling probe sphere, Bioinformatics, 26, 46, 10.1093/bioinformatics/btp599