Clustering Molecular Dynamics Trajectories: 1. Characterizing the Performance of Different Clustering Algorithms

Journal of Chemical Theory and Computation - Tập 3 Số 6 - Trang 2312-2334 - 2007
Jianyin Shao1, Stephen Tanner1, Nephi Thompson1, Thomas E. Cheatham1
1Departments of Medicinal Chemistry, Pharmaceutics and Pharmaceutical Chemistry, and Bioengineering, College of Pharmacy, University of Utah, 2000 East 30 South, Skaggs Hall 201, Salt Lake City, Utah 84112

Tóm tắt

Từ khóa


Tài liệu tham khảo

van Gunsteren W. F., 1982, Biochem. Soc. Trans., 10, 305, 10.1042/bst0100301

van Gunsteren W. F., 1982, Biochemistry, 21, 2274, 10.1021/bi00539a001

Kollman P. A., 2000, Acc. Chem. Res., 33, 897, 10.1021/ar000033j

van Gunsteren W. F., 2006, Angew. Chem., Int. Ed., 45, 4092, 10.1002/anie.200502655

Levitt M, 1983, J. Mol. Biol., 168, 617, 10.1016/S0022-2836(83)80305-2

Karplus M., 2002, Nat. Struct. Biol., 9, 652, 10.1038/nsb0902-646

Cheatham T. E., 2000, Ann. Rev. Phys. Chem., 51, 471

Duan Y., 1998, Science, 282, 744, 10.1126/science.282.5389.740

Hansson T., 2002, Curr. Opin. Struct. Biol., 12, 196, 10.1016/S0959-440X(02)00308-1

Tajkhorshid E., 2003, Adv. Protein Chem., 66, 247

Cheatham T. E., III, 2004, Curr. Opin. Struct. Biol., 14, 367

Feig M., 2004, Curr. Opin. Struct. Biol., 14, 224, 10.1016/j.sbi.2004.03.009

Wong C. F., 2003, Adv. Protein Chem., 66, 121

Rueda D., 2007, Proc. Natl. Acad. Sci., 104, 801, 10.1073/pnas.0605534104

Brooks C. I, 2002, Acc. Chem. Res., 35, 454, 10.1021/ar0100172

Daggett V, 2002, Acc. Chem. Res., 35, 449, 10.1021/ar0100834

Simmerling C., 2002, J. Am. Chem. Soc., 124, 11259, 10.1021/ja0273851

Pande V. S., 2003, Biopolymers, 68, 109, 10.1002/bip.10219

Wickstrom L., 2006, J. Mol. Biol., 360, 1107, 10.1016/j.jmb.2006.04.070

Day R., 2006, J. Mol. Biol., 366, 686

Juraszek J., 2006, Proc. Natl. Acad. Sci., 103, 15864, 10.1073/pnas.0606692103

Eleftheriou M., 2006, J. Am. Chem. Soc., 128, 13395, 10.1021/ja060972s

Yoda T., 2007, Proteins, 66, 859, 10.1002/prot.21264

Baumketner A., 2007, J. Mol. Biol., 366, 285, 10.1016/j.jmb.2006.11.015

Chen H. F., 2007, J. Am. Chem. Soc., 129, 2937

Paschek D., 2007, J. Struct. Biol., 157, 533, 10.1016/j.jsb.2006.10.031

Li W., 2007, Proteins, 67, 349

Periole X., 2007, J. Chem. Phys., 126, 014903, 10.1063/1.2404954

Scheraga H. A., 2007, Ann. Rev. Phys. Chem., 58, 83, 10.1146/annurev.physchem.58.032806.104614

Spackova N., 2003, J. Am. Chem. Soc., 125, 1769, 10.1021/ja025660d

Bui J. M., 2006, Proc. Natl. Acad. Sci., 103, 15456

Lu Y., 2006, J. Am. Chem. Soc., 128, 11839

Xu Y., 2006, Proteins, 64, 1068

de Jonge M. R., 2007, Proteins, 67, 980, 10.1002/prot.21376

Ode H., 2007, J. Med. Chem., 50, 1777

Hornak V., 2006, Proc. Natl. Acad. Sci., 103, 920, 10.1073/pnas.0508452103

Hornak V., 2006, J. Am. Chem. Soc., 128, 2813, 10.1021/ja058211x

Lankas F., 2006, Structure, 14, 1534, 10.1016/j.str.2006.08.004

Noy A., 2007, Nucl. Acids Res., 35, 3338

van der Vaart A., 2007, J. Chem. Phys., 126, 164106, 10.1063/1.2719697

Noe F., 2007, J. Chem. Phys., 126, 155102, 10.1063/1.2714539

Li D. W., 2007, J. Phys. Chem. B, 111, 5433

Patel S., 2007, J. Pept. Sci., 13, 326

Roccatano D., 2007, Biopolymers, 85, 421, 10.1002/bip.20690

Sefcikova J., 2007, Nucl. Acids Res., 35, 1946, 10.1093/nar/gkl1104

Razga F., 2006, Structure, 14, 835, 10.1016/j.str.2006.02.012

Kormos B. L., 2007, J. Struct. Biol., 157, 513, 10.1016/j.jsb.2006.10.022

Karpen M. E., 1993, Biochemistry, 32, 420, 10.1021/bi00053a005

Shenkin P. S., 1994, J. Comput. Chem., 15, 916, 10.1002/jcc.540150811

Cormack R. M, 1971, J. R. Stat. Soc. A, 134, 367

Jain A. K., 1999, ACM Comp. Surv., 31, 323

Torda A. E., 1994, J. Comput. Chem., 15, 1340, 10.1002/jcc.540151203

Marchionini C., 1983, Biochem. Biophys. Res. Comm., 112, 346, 10.1016/0006-291X(83)91836-3

Willett, P.Similarity and clustering in chemical information systems; John Wiley & Sons, Inc.  New York, 1987; Vol 1, p 266.

Kreissler M., 1989, J. Comput.-Aided Mol. Des., 3, 94, 10.1007/BF01590997

Unger R., 1989, Proteins, 5, 373, 10.1002/prot.340050410

Gordon H. L., 1992, Proteins, 14, 264, 10.1002/prot.340140211

Michel A., 1993, Comput. Chem., 17, 59, 10.1016/0097-8485(93)80028-C

Troyer J. M., 1995, Proteins, 23, 110, 10.1002/prot.340230111

Daura X., 1999, Proteins, 34, 280, 10.1002/(SICI)1097-0134(19990215)34:3<269::AID-PROT1>3.0.CO;2-3

Gabarro-Arpa J., 2000, Comput. Chem., 24, 698, 10.1016/S0097-8485(00)00067-X

Watts C. R., 2001, J. Biomol. Struct. Dyn., 18, 748, 10.1080/07391102.2001.10506703

Laboulais C., 2002, Proteins, 47, 179, 10.1002/prot.10081

Feher M., 2003, J. Chem. Inf. Comput. Sci., 43, 818

Bystroff C., 2003, Proteins, 50, 562, 10.1002/prot.10252

Moraitakis G., 2003, Biophys. J., 84, 2158, 10.1016/S0006-3495(03)75021-8

Lee M. C., 2005, Biophys. J., 88, 3146

Rao F., 2005, J. Chem. Phys., 122, 184901, 10.1063/1.1893753

Lyman E., 2006, Biophys. J., 91, 172, 10.1529/biophysj.106.082941

Sullivan D. C., 2006, J. Phys. Chem. B, 110, 16717

Li Y, 2006, J. Chem. Inf. Model., 46, 1750

Elmer S. P., 2004, J. Chem. Phys., 121, 12771, 10.1063/1.1812272

Sorin E. J., 2005, Biophys. J., 88, 2493

Sims G. E., 2005, Proc. Natl. Acad. Sci., 102, 621

Satoh D., 2006, FEBS Lett., 580, 3426, 10.1016/j.febslet.2006.05.015

Scott E. E., 2003, Proc. Natl. Acad. Sci., 100, 13201, 10.1073/pnas.2133986100

Poncin M., 1992, J. Mol. Biol., 226, 794, 10.1016/0022-2836(92)90632-T

Srinivasan J., 1998, J. Am. Chem. Soc., 120, 9409

Schlitter J, 1993, Chem. Phys. Lett., 215, 621, 10.1016/0009-2614(93)89366-P

Harris S. A., 2001, J. Am. Chem. Soc., 123, 12663, 10.1021/ja016233n

Fisher D., 1987, Improving inference through conceptual clustering, 465

Fisher D, 1987, Machine Learning, 2, 172

Cheeseman P., 1996, Advances in knowledge discovery and data mining, 83

Kohonen, T.Self-organizing maps, 3rd ed.; Springer:  Berlin-Heidelberg, 2001; Vol. 30, p 501.

Pearlman D. A., 1995, Comp. Phys. Comm., 91, 41, 10.1016/0010-4655(95)00041-D

Case D. A., 2005, J. Comput. Chem., 26, 1688

Guha, S.; Rastogi, R.; Shim, K. InCURE:  An efficient clusteringalgorithm for large databases; Proceedings of the ACM SIGMOD International Conference on Management of Data:  New York, 1998; pp 73−84.

Witten, I. H.; Frank, E.Data mining:  Practical machine learning toolsand techniques with Java implementations; Morgan Kaufmann:  1999; p 525.

Kohonen, T.Self-organization and Associative Memory; Springer-Verlag:  Berlin, 2001; Vol. 30, p 501.

Davies D. L., 1979, IEEE Trans. Pattern Anal. Mach. Intelligence, 1, 227

Vesanto J., 2000, IEEE Trans. Neural Networks, 11, 600, 10.1109/72.846731

Bolshakova, N.; Azuaje, F.Cluster validation techniques for genomeexpression data; University of Dublin, Trinity College:  Dublin, 2002; p 13.

Speer N., 2005, Advances in intelligent data analysis VI, 3646, 439, 10.1007/11552253_39

Calinski T., 1974, Comm. Stat., 3, 27

Mitchell, T.Machine Learning; McGraw-Hill:  1997; p 432.

Ryckaert J. P., 1977, J. Comp. Phys., 23, 341, 10.1016/0021-9991(77)90098-5

Berendsen H. J. C., 1984, J. Comp. Phys., 81, 3690

Cornell W. D., 1995, J. Am. Chem. Soc., 117, 5197, 10.1021/ja00124a002

Jorgensen W. L., 1983, J. Chem. Phys., 79, 935, 10.1063/1.445869

Aqvist J, 1990, J. Phys. Chem., 94, 8024, 10.1021/j100384a009

Cheatham T. E., 1998, J. Biomol. Struct. Dyn., 16, 280, 10.1080/07391102.1998.10508245

Wu X. W., 1998, J. Phys. Chem., 102, 7250, 10.1021/jp980839p

Wu X., 2001, J. Phys. Chem. B, 105, 2235

Wu X., 2004, Biophys. J., 86, 1958

Wu X., 2002, J. Am. Chem. Soc., 124, 5283

Pettersen E. F., 2004, J. Comput. Chem., 25, 1612, 10.1002/jcc.20084

Boykin D. W., 1998, J. Med. Chem., 41, 129, 10.1021/jm970570i

Wilson W. D., 1998, J. Am. Chem. Soc., 120, 10321

Mazur S., 2000, J. Mol. Biol., 300, 337, 10.1006/jmbi.2000.3869

Hawkins G. D., 1995, Chem. Phys. Lett., 246, 129, 10.1016/0009-2614(95)01082-K

Tsui V., 2000, J. Am. Chem. Soc., 122, 2498, 10.1021/ja9939385

Wang J., 2006, J. Mol. Graphics Modell., 25, 260, 10.1016/j.jmgm.2005.12.005

Wang J., 2001, J. Comput. Chem., 22, 1228

Bayly C. I., 1993, J. Phys. Chem., 97, 10280, 10.1021/j100142a004

Frisch M. J., 2001, Gaussian 98 (Revision A.10)

Laughton C. A., 1996, Biochemistry, 35, 5661, 10.1021/bi952162r