Identifying the interacting positions of a protein using Boolean learning and support vector machines

Computational Biology and Chemistry - Tập 30 - Trang 268-279 - 2006
Anshul Dubey1, Matthew J. Realff1, Jay H. Lee1, Andreas S. Bommarius1,2,3
1School of Chemical and Biomolecular Engineering, 311 Ferst Drive, Atlanta, GA 30332, United States
2School of Chemistry and Biochemistry, Georgia Institute of Technology, United States
3Parker H. Petit Institute of Bioengineering and Bioscience, Georgia Institute of Technology, United States

Tài liệu tham khảo

Brown, 2000, Knowledge-based analysis of microarray gene expression data using support vector machines, Proc. Natl. Acad. Sci. U.S.A., 97, 262, 10.1073/pnas.97.1.262 Chen, 1993, Tuning the activity of an enzyme for unusual environments: sequential random mutagenesis of subtilisin e for catalysis in dimethylformamide, PNAS, 90, 5618, 10.1073/pnas.90.12.5618 Christianini, 2000 Cleland, 1996 Daugherty, 2000, Quantitative analysis of the effect of the mutation frequency on the affinity maturation of single chain fv antibodies, PNAS, 97, 2029, 10.1073/pnas.030527597 Deniz, 2003, Face recognition using independent component analysis and support vector machines, Pattern Recogn. Lett., 24, 2153, 10.1016/S0167-8655(03)00081-3 Deshpande, 1998, A greedy randomized adaptive search procedure (GRASP) for inferring logical clauses from examples in polynomial time and some extensions, Math. Comput. Model., 27, 75, 10.1016/S0895-7177(97)00255-0 Dubey, 2005, Support vector machines for learning to identify the critical positions of a protein, J. Theor. Biol., 234, 351, 10.1016/j.jtbi.2004.11.037 Efron, 1987, Better bootstrap confidence intervals, J. Am. Stat. Assoc., 82, 171, 10.2307/2289144 Efron, 1993 Fisher, 1966 Freund, 1997, A decision-theoretic generalization of on-line learning and an application to boosting, J. Comput. Syst. Sci., 55, 119, 10.1006/jcss.1997.1504 Huang, 1996, Amino acid sequence determinants of b-lactamase structure and activity, J. Mol. Biol., 258, 688, 10.1006/jmbi.1996.0279 Ikeuchi, 2003, Chimeric gene library construction by a simple and highly versatile method using recombination-dependent exponential amplification, Biotechnol. Prog., 19, 1460, 10.1021/bp034029t Jemwa, 2005, Improving process operations using support vector machines and decision trees, AIChE J., 51, 526, 10.1002/aic.10315 Kim, 2003, Protein secondary structure prediction based on an improved support vector machines approach, Protein Eng., 16, 553, 10.1093/protein/gzg072 Kuhlman, 2003, Design of a novel globular protein fold with atomic-level accuracy, Science, 302, 1364, 10.1126/science.1089427 Lin, 2002, Screening and selection methods for large-scale analysis of protein function, Angew. Chem. Int. Ed., 41, 4402, 10.1002/1521-3773(20021202)41:23<4402::AID-ANIE4402>3.0.CO;2-H Lunneborg, 2000 Meyer, 2003, Library analysis of SCHEMA-guided protein recombination, Protein Sci., 12, 1686, 10.1110/ps.0306603 Neylon, 2004, Chemical and biochemical strategies for the randomization of protein encoding DNA sequences: library construction methods for directed evolution, Nucleic Acids Res., 32, 1448, 10.1093/nar/gkh315 Petrounia, 2000, Designed evolution of enzymatic properties, Curr. Opin. Biotechnol., 11, 325, 10.1016/S0958-1669(00)00107-5 Platt, J.C., 1998. Sequential minimal optimization: a fast algorithm for training support vector machines, Microsoft Research, MSR-TR-98-14. Sanchez, 2003, Advanced support vector machines and kernel methods, Neurocomputing, 55, 5, 10.1016/S0925-2312(03)00373-4 Sanchez, 2002, An incremental learning algorithm for constructing Boolean functions from positive and negative examples, Comput. Oper. Res., 29, 1677, 10.1016/S0305-0548(01)00050-8 Saraf, 2003, Using a residue clash map to functionally characterize protein recombination hybrids, Protein Eng., 16, 1025, 10.1093/protein/gzg129 Saraf, 2004, FamClash: a method for ranking the activity of engineered enzymes, PNAS, 101, 4142, 10.1073/pnas.0400065101 Schapire, 1998, Boosting the margin: a new explanation for the effectiveness of voting methods, Ann. Stat., 26, 1651, 10.1214/aos/1024691352 Schneeweiss, 1989 Schölkopf, 2002 Soumillion, 2001, Novel concepts for selection of catalytic activity, Curr. Opin. Biotechnol., 12, 387, 10.1016/S0958-1669(00)00232-9 Stemmer, 1994, Rapid evolution of a protein in vitro by DNA shuffling, Nature, 370, 389, 10.1038/370389a0 Triantaphyllou, 1994, Inference of a minimum size Boolean function from examples by using a new efficient branch-and-bound approach, J. Global Optim., 5, 69, 10.1007/BF01097004 Triantaphyllou, 1996, An approach to guided learning of boolean functions, Math. Comput. Model., 23, 69, 10.1016/0895-7177(95)00234-0 Vapnik, 1995 Voet, 1995 Wahler, 2001, Novel methods for biocatalyst screening, Curr. Opin. Chem. Biol., 5, 152, 10.1016/S1367-5931(00)00184-8 Ward, 2003, Secondary structure prediction with support vector machines, Bioinformatics, 19, 1650, 10.1093/bioinformatics/btg223 Zhao, 1997, Combinatorial protein design: strategies for screening protein libraries, Curr. Opin. Biotechnol., 7, 480