A hierarchical approach to all‐atom protein loop prediction

Proteins: Structure, Function and Bioinformatics - Tập 55 Số 2 - Trang 351-367 - 2004
Matthew P. Jacobson1, David L. Pincus2, Chaya S. Rapp3, Tyler Day4, Barry Honig5, David E. Shaw4, Richard A. Friesner2
1Department of Pharmaceutical Chemistry, University of California, San Francisco, California#TAB#
2Department of Chemistry, Columbia University, New York, New York
3Department of Chemistry, Stern College, Yeshiva University, New York, New York
4Schrödinger, Inc., New York, New York
5Howard Hughes Medical Institute and Department of Biochemistry and Molecular Biophysics, Columbia University, New York, New York

Tóm tắt

AbstractThe application of all‐atom force fields (and explicit or implicit solvent models) to protein homology‐modeling tasks such as side‐chain and loop prediction remains challenging both because of the expense of the individual energy calculations and because of the difficulty of sampling the rugged all‐atom energy surface. Here we address this challenge for the problem of loop prediction through the development of numerous new algorithms, with an emphasis on multiscale and hierarchical techniques. As a first step in evaluating the performance of our loop prediction algorithm, we have applied it to the problem of reconstructing loops in native structures; we also explicitly include crystal packing to provide a fair comparison with crystal structures. In brief, large numbers of loops are generated by using a dihedral angle‐based buildup procedure followed by iterative cycles of clustering, side‐chain optimization, and complete energy minimization of selected loop structures. We evaluate this method by using the largest test set yet used for validation of a loop prediction method, with a total of 833 loops ranging from 4 to 12 residues in length. Average/median backbone root‐mean‐square deviations (RMSDs) to the native structures (superimposing the body of the protein, not the loop itself) are 0.42/0.24 Å for 5 residue loops, 1.00/0.44 Å for 8 residue loops, and 2.47/1.83 Å for 11 residue loops. Median RMSDs are substantially lower than the averages because of a small number of outliers; the causes of these failures are examined in some detail, and many can be attributed to errors in assignment of protonation states of titratable residues, omission of ligands from the simulation, and, in a few cases, probable errors in the experimentally determined structures. When these obvious problems in the data sets are filtered out, average RMSDs to the native structures improve to 0.43 Å for 5 residue loops, 0.84 Å for 8 residue loops, and 1.63 Å for 11 residue loops. In the vast majority of cases, the method locates energy minima that are lower than or equal to that of the minimized native loop, thus indicating that sampling rarely limits prediction accuracy. The overall results are, to our knowledge, the best reported to date, and we attribute this success to the combination of an accurate all‐atom energy function, efficient methods for loop buildup and side‐chain optimization, and, especially for the longer loops, the hierarchical refinement protocol. Proteins 2004;55:000–000. © 2004 Wiley‐Liss, Inc.

Từ khóa


Tài liệu tham khảo

10.1016/S0022-2836(02)00470-9

10.1006/jmbi.1996.0851

10.1006/jmbi.1996.0819

10.1006/jmbi.1999.2826

10.1006/jmbi.1996.0857

10.1016/0263-7855(86)80026-1

Shenkin PS, 1987, Method for quickly generating random conformations of ring‐like structures for subsequent energy minimization or molecular‐dynamics—application to antibody hypervariable loops, Biophys J, 51, A232

10.1073/pnas.102179699

10.1110/ps.9.9.1753

10.1002/1097-0282(2001)60:2<153::AID-BIP1010>3.0.CO;2-6

10.1002/jcc.540110115

10.1002/jcc.540130307

10.1002/prot.1041

10.1002/bip.360260114

10.1002/(SICI)1097-0134(19990501)35:2<173::AID-PROT4>3.0.CO;2-2

10.1002/(SICI)1097-0134(20000701)40:1<135::AID-PROT150>3.0.CO;2-1

10.1002/prot.10285

10.1002/(SICI)1097-0282(199701)41:1<61::AID-BIP6>3.0.CO;2-0

10.1002/(SICI)1096-987X(199906)20:8<819::AID-JCC8>3.0.CO;2-Y

Go N, 1970, Ring closure and local conformational deformations of chain molecules, Biopolymers, 3, 178

10.1021/ma00154a069

10.1002/prot.340180205

10.1002/prot.10235

10.1021/jp021564n

10.1021/ja9621760

10.1021/jp003919d

10.1002/jcc.10045

10.1021/jp982533o

Hartigan JA, 1975, Clustering algorithms

10.2307/2346830

10.1006/jmbi.2001.4865

10.1137/S1052623497313642

10.1002/jcc.540080711

10.1145/146847.146921

10.4310/MRL.1994.v1.n6.a3

Li X, 2003, High resolution prediction of protein helix positions and orientations, Proteins

JacobsonMP PincusDL DayTJF RappCS LiX AnY FriesnerRA.Use of all‐atom physical chemistry energy functions for comparative model construction selection and refinement. Protein Sci2003. Submitted for publication.