Elucidating the ensemble of functionally-relevant transitions in protein systems with a robotics-inspired method

Springer Science and Business Media LLC - Tập 13 - Trang 1-22 - 2013
Kevin Molloy1, Amarda Shehu1,2,3
1Department of Computer Science, George Mason University, Fairfax, USA
2Department of Bioengineering, George Mason University, Fairfax, USA
3School of Systems Biology, George Mason University, Manassas, USA

Tóm tắt

Many proteins tune their biological function by transitioning between different functional states, effectively acting as dynamic molecular machines. Detailed structural characterization of transition trajectories is central to understanding the relationship between protein dynamics and function. Computational approaches that build on the Molecular Dynamics framework are in principle able to model transition trajectories at great detail but also at considerable computational cost. Methods that delay consideration of dynamics and focus instead on elucidating energetically-credible conformational paths connecting two functionally-relevant structures provide a complementary approach. Effective sampling-based path planning methods originating in robotics have been recently proposed to produce conformational paths. These methods largely model short peptides or address large proteins by simplifying conformational space. We propose a robotics-inspired method that connects two given structures of a protein by sampling conformational paths. The method focuses on small- to medium-size proteins, efficiently modeling structural deformations through the use of the molecular fragment replacement technique. In particular, the method grows a tree in conformational space rooted at the start structure, steering the tree to a goal region defined around the goal structure. We investigate various bias schemes over a progress coordinate for balance between coverage of conformational space and progress towards the goal. A geometric projection layer promotes path diversity. A reactive temperature scheme allows sampling of rare paths that cross energy barriers. Experiments are conducted on small- to medium-size proteins of length up to 214 amino acids and with multiple known functionally-relevant states, some of which are more than 13Å apart of each-other. Analysis reveals that the method effectively obtains conformational paths connecting structural states that are significantly different. A detailed analysis on the depth and breadth of the tree suggests that a soft global bias over the progress coordinate enhances sampling and results in higher path diversity. The explicit geometric projection layer that biases the exploration away from over-sampled regions further increases coverage, often improving proximity to the goal by forcing the exploration to find new paths. The reactive temperature scheme is shown effective in increasing path diversity, particularly in difficult structural transitions with known high-energy barriers.

Tài liệu tham khảo

Huang YJ, Montellione GT: Structural biology: proteins flex to function. Nature 2005, 438(7064):36–37. 10.1038/438036a Berman HM, Henrick K, Nakamura H: Announcing the worldwide Protein Data Bank. Nat Struct Biol 2003, 10(12):980–980. 10.1038/nsb1203-980 Shehu A: Probabilistic Search and Optimization for Protein Energy Landscapes. In Handbook of Computational Molecular Biology Edited by: Aluru S, Singh A. 2013. Chapman & Hall/CRC Computer & Information Science Series Majek P, Weinstein H, Elber R: Pathways of conformational transitions in proteins. In Coarse-graining of Condensed Phase and Biomolecular Systems Edited by: Voth GA. 2008, 185–203. Taylor and Francis group van Gunsteren WF, et al.: Biomolecular modeling: goals, problems, perspectives. Angew Chem Int Ed Engl 2006, 45(25):4064–4092. 10.1002/anie.200502655 Hansson T, Oostenbrink C, van Gunsteren WF: Molecular dynamics simulations. Curr Opinion Struct Biol 2002, 12(2):190–196. 10.1016/S0959-440X(02)00308-1 Karplus M, McCammon JA: Molecular dynamics simulations of biomolecules. Nat Struct Biol 2002, 9(9):646–652. 10.1038/nsb0902-646 Karplus M, Kuriyan J: Molecular dynamics and protein function. Proc Natl Acad Sci USA 2005, 102(19):6679–6685. 10.1073/pnas.0408930102 Huang H, Ozkirimli E, Post CB: A comparison of three perturbation molecular dynamics methods for mModeling conformational transitions. J Chem Theory Comput 2009, 5(5):1301–1314. Beckstein O, Denning EJ, Perilla JR, Woolf TB: Zipping and unzipping of adenylate kinase: atomistic insights into the ensemble of open-closed transitions. J Mol Biol 2009, 394: 160–176. 10.1016/j.jmb.2009.09.009 Malek R, Mousseau N: Dynamics of Lennard-Jones clusters: a characterization of the activation-relaxation technique. Phys Rev E 2000, 62(6):7723–7728. 10.1103/PhysRevE.62.7723 Earl DJ, Deem MW: Parallel tempering: theory, applications, and new perspectives. Phys Chem Chem Phys 2005, 7: 3910–3916. 10.1039/b509983h Arora K, Brooks CLI: Large-scale allosteric conformational transitions of adenylate kinase appear to involve a population-shift mechanism. Proc Natl Acad Sci USA 2007, 104(47):18496–18501. 10.1073/pnas.0706443104 Zhang Y, Kihara D, Skolnick J: Local energy landscape flattening: parallel hyperbolic Monte Carlo sampling of protein folding. Proteins 2002, 48(2):192–201. 10.1002/prot.10141 Schulze BG, Grubmueller H, Evanseck JD: Functional significance of hierarchical tiers in carbonmonoxy myoglobin: conformational substates and transitions studied by conformational flooding simulations. J Am Chem Soc 2000, 122(36):8700–8711. 10.1021/ja993788y Krueger P, Verheyden S, Declerck PJ, Engelborghs Y: Extending the capabilities of targeted molecular dynamics: simulation of a large conformational transition in plasminogen activator inhibitor 1. Protein Sci 2001, 10(4):798–808. 10.1110/ps.40401 Schlitter J, Engels M, Krüger P: Targeted molecular dynamics - a new approach for searching pathways of conformational transitions. J Mol Graph 1994, 12(2):84–89. 10.1016/0263-7855(94)80072-3 Mashi RJ, Jakobsson E: End-point targeted molecular dynamics: large-scale conformational changes in potassium channels. Biophys J 2008, 94(11):4307–4319. 10.1529/biophysj.107.118778 van der Vaart A, Karplus M: Minimum free energy pathways and free energy profiles for conformational transitions based on atomistic molecular dynamics simulations. J Chem Phys 2007, 126: 164106. 10.1063/1.2719697 Pan AC, Sezer D, Roux B: Finding transition pathways using the string method with swarms of trajectories. J Phys Chem B 2008, 112(11):3432–3440. 10.1021/jp0777059 Zhang BW, Jasnow D, Zuckermann DM: Efficient and verified simulation of a path ensemble for conformational change in a united-residue model of calmodulin. Proc Natl Acad Sci USA 2007, 104(46):18043–18048. 10.1073/pnas.0706349104 Kim KM, Jernigan RL, Chirikjian GS: Efficient generation of feasible pathways for protein conformationa transitions. Biophys J 2002, 83(3):1620–1630. 10.1016/S0006-3495(02)73931-3 Schuyler AD, Jernigan RL, Qasba PK, Ramakrishnan B, Chirikjian GS: Iterative cluster-NMA: A tool for generating conformational transitions in proteins. Proteins 2009, 74(3):760–776. 10.1002/prot.22200 Zheng W, Brooks B: Identification of dynamical correlations within the myosin motor domain by the normal mode analysis of an elastic network model. J Mol Biol 2005, 346(3):745–759. 10.1016/j.jmb.2004.12.020 Bahar R, Rader AJ: Coarse-grained normal mode analysis in structural biology. Curr Opin Struct Biol 2005, 15: 586–592. 10.1016/j.sbi.2005.08.007 Kantarci-Carsibasi N, Haliloglu T, Doruker P: Conformational transition pathways explored by Monte Carlo simulation integrated with collective modes. Biophys J 2008, 95(12):5862–5873. 10.1529/biophysj.107.128447 Korkut A, Hendrickson WA: Computation of conformational transtions in proteins by virtual atom molecular mechanics as validated in application to adenylate kinase. Proc Natl Acad Sci USA 2009, 106(37):15673–15678. 10.1073/pnas.0907684106 Teknipar M, Zheng W: Predicting order of conformational changes during protein conformational transitions using an interpolated elastic network model. Proteins 2010, 78(11):2469–2481. Kirillova S, Cortés J, Stefaniu A, Siméon T: An NMA-guided path planning approach for computing large-amplitude conformational changes in proteins. Proteins 2008, 70: 131–143. Lou H, Wang RI: Molecular dynamics of apo-adenylate kinase: a distance replica exchange method for the free energy of conformational fluctuations. J Phys Chem B 2006, 110(47):24121–24137. 10.1021/jp064303c Kuniztki MB, de Groot BL: The atomistic mechanism of conformational transition in adenylate kinase: a TEE-REX molecular dynamics study. Structure 2008, 16(8):1175–1182. 10.1016/j.str.2008.04.013 Chu JW, Trout BL, Brooks CLI: A super-LINEAR minimization scheme for the nudged elastic band method. J Chem Phys 2003, 119: 12708–12717. 10.1063/1.1627754 Maragliano L, Fischer A, Vanden-Eijnden EJ, Ciccotti G: String method in collective variables: minimum free energy paths and isocommittor surfaces. J Chem Phys 2006, 125: 24106. 10.1063/1.2212942 Weinan E, Ren W, Vanden-Eijnden E: Simplified and improved string method for computing the minimum energy paths in barrier-crossing events. J Chem Phys 2007, 126: 164103. 10.1063/1.2720838 Maragliano L, Vanden-Eijnden E: On-the-fly string method for minimum free energy paths calculation. Chem Phys Lett 2007, 446: 182–190. 10.1016/j.cplett.2007.08.017 Weinan E, Ren W, Vanden-Eijnden E: Finite temperature string methods for the study of rare events. J Phys Chem B 2005, 109: 6688–6693. 10.1021/jp0455430 Ren W, Vanden-Eijnden E, Maragakis P, Weinan E: Transition pathways in complex systems: application of the finite-temperature string method to the alanine dipeptide. J Chem Phys 2005, 123: 134109. 10.1063/1.2013256 Weiss DR, Levitt M: Can morphing methods predict intermediate structures? J Mol Biol 2009, 385(2):665–674. 10.1016/j.jmb.2008.10.064 Kavraki LE, Svetska P, Latombe JC, Overmars M: Probabilistic roadmaps for path planning in high-dimensional configuration spaces. IEEE Trans Robot Autom 1996, 12(4):566–580. 10.1109/70.508439 Choset H, et al.: Principles of Robot Motion: Theory, Algorithms, and Implementations. 1st edition. Cambridge, MA: MIT Press; 2005. LaValle SM, Kuffner JJ: Randomized kinodynamic planning. Int J Robot Res 2001, 20(5):378–400. 10.1177/02783640122067453 Hsu D, Kindel R, Latombe JC, Rock S: Randomized kinodynamic motion planning with moving obstacles. Int J Robot Res 2002, 21(3):233–255. 10.1177/027836402320556421 Ladd AM, Kavraki LE: Motion planning in the presence of drift, underactuation and discrete system changes. In Robotics: Sci and Syst. 2005; 233–241. Boston, MA Song G, Amato NM: A motion planning approach to folding: from paper craft to protein folding. IEEE Trans Robot Autom 2004, 20: 60–71. 10.1109/TRA.2003.820926 Chiang TH, Apaydin MS, Brutlag DL, Hsu D, Latombe JC: Using stochastic roadmap simulation to predict experimental quantities in protein folding kinetics: folding rates and phi-values. J Comput Biol 2007, 14(5):578–593. 10.1089/cmb.2007.R004 Cortes J, Simeon T, de Angulo R, Guieysse D, Remaud-Simeon M, Tran V: A path planning approach for computing large-amplitude motions of flexible molecules. Bioinformatics 2005, 21(S1):116–125. Shehu A: An ab-initio tree-based exploration to enhance sampling of low-energy protein conformations. In Robot: Sci and Sys. Seattle, WA, USA; 2009:241–248. Shehu A, Olson B: Guiding the search for native-like protein conformations with an ab-initio tree-based exploration. Int J Robot Res 2010, 29(8):1106–11227. 10.1177/0278364910371527 Olson B, Molloy K, Shehu A: In search of the protein native state with a probabilistic sampling approach. J Bioinform Comput Biol 2011, 9(3):383–398. 10.1142/S0219720011005574 Jaillet L, Corcho FJ, Perez JJ, Cortes J: Randomized tree construction algorithm to explore energy landscapes. J Comput Chem 2011, 32(16):3464–3474. 10.1002/jcc.21931 Haspel N, Moll M, Baker ML, Chiu W, E KL: Tracing conformational changes in proteins. BMC Struct Biol 2010, 10(Suppl1):S1. Shehu A: Conformational Search for the Protein Native State. In Protein Structure Prediction: Method and Algorithms. Edited by: Rangwala H, Karypis G, Fairfax VA. Wiley Book Series on Bioinformatics; 2010. Olson B, Molloy K, Hendi SF, Shehu A: Guiding probabilistic search of the protein conformational space with structural profiles. J Bioinform Comput Biol 2012, 10(3):1242005. 10.1142/S021972001242005X Olson B, Shehu A: Evolutionary-inspired probabilistic search for enhancing sampling of local minima in the protein energy surface. Proteome Sci 2012, 10(Suppl 1):S5. 10.1186/1477-5956-10-S1-S5 Amato NM, Dill KA, Song G: Using motion planning to map protein folding landscapes and analyze folding kinetics of known native structures. J Comput Biol 2003, 10(3–4):239–255. 10.1089/10665270360688002 Molloy K, Shehu A: A robotics-inspired method to sample conformational paths connecting known functionally-relevant structures in protein systems. Bioinformatics and Biomedicine Workshops (BIBMW), 2012 IEEE International Conference on: 4–7 October 2012 2012, 56–63. 10.1109/BIBMW.2012.6470380 McLachlan AD: A mathematical procedure for superimposing atomic coordinates of proteins. Acta Crystallogr A 1972, 26(6):656–657. Maiorov VN, Crippen GM: Significance of root-mean-square deviation in comparing three-dimensional structures of globular proteins. J Mol Biol 1994, 235(2):625–634. 10.1006/jmbi.1994.1017 Brooks BR, Bruccoleri RE, Olafson BD, States DJ, Swaminathan S, Karplus M: CHARMM: a program for macromolecular energy, minimization, and dynamics calculations. J Comput Chem 1983, 4(2):187–217. 10.1002/jcc.540040211 Zhang M, Kavraki LE: A new method for fast and accurate derivation of molecular conformations. J Chem Inf Comput Sci 2002, 42: 64–70. 10.1021/ci010327z Xu J: Rapid protein side-chain packing via tree decomposition.In Research in Computational Molecular Biology, Volume 3500 of Lecture Notes in Computer Science Edited by: Miyano S, Mesirov J, Kasif S, Istrail S, Pevzner P, Waterman M. Springer Berlin Heidelberg; 2005, 423–439. [http://dx.doi.org/10.1007/11415770_32] Krivov GG, Shapovalov MV, Dunbrack RL Jr.: Improved prediction of protein side-chain conformations with SCWRL4. Proteins 2009, 77(4):778–795. 10.1002/prot.22488 Rohl CA, Strauss CE, Misura KM, Baker D: Protein structure prediction using Rosetta. Methods Enzymol 2004, 383: 66–93. Papoian GA, Ulander J, Eastwood MP, Luthey-Schulten Z, Wolynes PG: Water in protein structure prediction. Proc Natl Acad Sci USA 2004, 101(10):3352–3357. 10.1073/pnas.0307851100 Prentiss MC, Wales DJ, Wolynes PG: Protein structure prediction using basin-hopping. The Journal of Chemical Physics 2008, 128(22):225106–225106. 10.1063/1.2929833 Hegler JA, Laetzer J, Shehu A, Clementi C, Wolynes PG: Restriction vs. guidance: fragment assembly and associative memory hamiltonians for protein structure prediction. Proc Natl Acad Sci USA 2009, 106(36):15302–15307. 10.1073/pnas.0907002106 Prentiss MC, Hardin C, Eastwood MP, Zong C, Wolynes PG: Protein structure prediction: the next generation. J Chem Theory Comput 2006, 2(3):705–716. 10.1021/ct0600058 Shehu A, Kavraki LE, Clementi C: Multiscale characterization of protein conformational ensembles. Proteins 2009, 76(4):837–851. 10.1002/prot.22390 Case DA, et al.: AMBER 9. University of California, San Francisco; 2006. Bonneau R, Baker D: De novo prediction of three-dimensional structures for major protein families. J Mol Biol 2002, 322: 65–78. 10.1016/S0022-2836(02)00698-8 Bradley P, Misura KMS, Baker D: Toward high-resolution de novo structure prediction for small proteins. Science 2005, 309(5742):1868–1871. 10.1126/science.1113801 Brunette TJ, Brock O: Guiding conformation space search with an all-atom energy potential. Proteins 2008, 73(4):958–972. 10.1002/prot.22123 DeBartolo J, Colubri A, Jha AK, Fitzgerald JE, Freed KF, Sosnick TR: Mimicking the folding pathway to improve homology-free protein structure prediction. Proc Natl Acad Sci USA 2009, 106(10):3734–3739. 10.1073/pnas.0811363106 Ding F, Tsao D, Nie H, Dokholyan NV: Ab initio folding of proteins with all-atom discrete molecular dynamics. Structure 2008, 16(7):1010–1018. 10.1016/j.str.2008.03.013 Project E, Friedman R, Nachliel E, Gutman M: A molecular dynamics study of the effect of Ca2+removal on calmodulin structure. Biophys J 2006, 90(11):3842–3850. 10.1529/biophysj.105.077792 Finn BE, Evenäas J, Drakenberg T, Waltho JP, Thulin E, Forséen S: Calcium-induced structural changes and domain autonomy in calmodulin. Nat Struct Biol 1995, 2(9):777–783. 10.1038/nsb0995-777 Evenäas J, Forséen S, Malmendal A, Akke M: Backbone dynamics and energetics of a calmodulin domain mutant exchanging between closed and open conformations. J Mol Biol 1999, 289(3):603–617. 10.1006/jmbi.1999.2770 Kumar S, Rosenberg JM, Bouzida D, Swendsen RH, Kollman PA: The weighted histogram analysis method for free-energy calculations on biomolecules: I. The method. J Comput Chem 1993, 13(8):1011–1021. Ravindranathan KP, Gallicchio E, Levy RM: Conformational equilibria and free energy profiles for the allosteric transition of the ribose-binding protein. J Mol Biol 2005, 353: 196–210. 10.1016/j.jmb.2005.08.009 Arnold K, Bordoli L, Kopp J, Schwede T: The SWISS-MODEL workspace: a web-based environment for protein structure homology modelling. Bioinformatics 2006, 22(2):195–201. 10.1093/bioinformatics/bti770 Boehr DD, Wright PE: How do proteins interact? Science 2008, 320(5882):1429–1430. 10.1126/science.1158818 Boehr DD, Nussinov R, Wright PE: The role of dynamic conformational ensembles in biomolecular recognition. Nat Chem Biol 2009, 5(11):789–96. 10.1038/nchembio.232 Jaillet L, Cortes J, Simeon T: Transition-based RRT for path planning in continuous cost spaces. In IEEE/RSJ Int Conf Intel Rob Sys. Stanford, CA: AAAI; 2008:22–26.