Understanding the effectiveness of enzyme pre-reaction state by a quantum-based machine learning model

Cell Reports Physical Science - Tập 3 - Trang 101128 - 2022
Shenggan Luo1, Lanxuan Liu2, Chu-Jun Lyu1, Byuri Sim1, Yihan Liu1, Haifan Gong2, Yao Nie3, Yi-Lei Zhao1
1State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic and Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
2Shanghai Artificial Intelligence Laboratory, Shanghai, 200232, China
3School of Biotechnology and Key Laboratory of Industrial Biotechnology Ministry of Education, Jiangnan University, Wuxi 214122, China

Tài liệu tham khảo

Mengel, 1999, Around and beyond cram's rule, Chem. Rev., 99, 1191, 10.1021/cr980379w Qin, 2018, Discovery of a switch between prelog and anti-prelog reduction toward halogen-substituted acetophenones in short-chain dehydrogenase/reductases, ACS Catal., 8, 6012, 10.1021/acscatal.8b00807 Nie, 2018, Enzyme engineering based on X-ray structures and kinetic profiling of substrate libraries: alcohol dehydrogenases for stereospecific synthesis of a broad range of chiral alcohols, ACS Catal., 8, 5145, 10.1021/acscatal.8b00364 Poree, 2017, A holy grail in chemistry: computational catalyst design: feasible or fiction?, Acc. Chem. Res., 50, 605, 10.1021/acs.accounts.6b00606 Peng, 2016, Computing organic stereoselectivity - from concepts to quantitative calculations and predictions, Chem. Soc. Rev., 45, 6093, 10.1039/C6CS00573J Zhang, 2003, Influence of structural fluctuation on enzyme reaction energy barriers in combined quantum mechanical/molecular mechanical studies, J. Phys. Chem. B, 107, 4459, 10.1021/jp022525e von der Esch, 2019, Finding reactive configurations: a machine learning approach for estimating energy barriers applied to Sirtuin 5, J. Chem. Theor. Comput., 15, 6660, 10.1021/acs.jctc.9b00876 Smith, 2019, Approaching coupled cluster accuracy with a general-purpose neural network potential through transfer learning, Nat. Commun., 10, 2903, 10.1038/s41467-019-10827-4 St John, 2020, Prediction of organic homolytic bond dissociation enthalpies at near chemical accuracy with sub-second computational cost, Nat. Commun., 11, 2328, 10.1038/s41467-020-16201-z Lemm, 2021, Machine learning based energy-free structure predictions of molecules, transition states, and solids, Nat. Commun., 12, 4468, 10.1038/s41467-021-24525-7 Vamathevan, 2019, Applications of machine learning in drug discovery and development, Nat. Rev. Drug Discov., 18, 463, 10.1038/s41573-019-0024-5 Manzhos, 2021, Neural network potential energy surfaces for small molecules and reactions, Chem. Rev., 121, 10187, 10.1021/acs.chemrev.0c00665 Wang, 2020, Simulation and design of energy materials accelerated by machine learning, WIREs Comput. Mol. Sci., 10, 1, 10.1002/wcms.1421 Sigman, 2016, The development of multidimensional analysis tools for asymmetric catalysis and beyond, Acc. Chem. Res., 49, 1292, 10.1021/acs.accounts.6b00194 Reid, 2019, Holistic prediction of enantioselectivity in asymmetric catalysis, Nature, 571, 343, 10.1038/s41586-019-1384-z Reid, 2019, Predictive multivariate linear regression analysis guides successful catalytic enantioselective minisci reactions of diazines, J. Am. Chem. Soc., 141, 19178, 10.1021/jacs.9b11658 Zahrt, 2019, Prediction of higher-selectivity catalysts by computer-driven workflow and machine learning, Science, 363, 10.1126/science.aau5631 Henle, 2020, Development of a computer-guided workflow for catalyst optimization. Descriptor validation, subset selection, and training set analysis, J. Am. Chem. Soc., 142, 11578, 10.1021/jacs.0c04715 Jorner, 2021, Machine learning meets mechanistic modelling for accurate prediction of experimental activation energies, Chem. Sci., 12, 1163, 10.1039/D0SC04896H Li, 2020, Predicting regioselectivity in radical C-H functionalization of heterocycles through machine learning, Angew Chem. Int. Ed. Engl., 59, 13253, 10.1002/anie.202000959 Yang, 2021, Machine learning prediction of hydrogen atom transfer reactivity in photoredox-mediated C–H functionalization, Org. Chem. Front., 8, 6187, 10.1039/D1QO01325D Singh, 2019, Predicting chemical reaction barriers with a machine learning model, Catal. Lett., 149, 2347, 10.1007/s10562-019-02705-x Choi, 2018, Feasibility of activation energy prediction of gas-phase reactions by machine learning, Chemistry, 24, 12354, 10.1002/chem.201800345 von Lilienfeld, 2020, Exploring chemical compound space with quantum-based machine learning, Nat. Rev. Chem, 4, 347, 10.1038/s41570-020-0189-9 von Lilienfeld, 2020, Retrospective on a decade of machine learning for chemical discovery, Nat. Commun., 11, 4895, 10.1038/s41467-020-18556-9 Heinen, 2021, Toward the design of chemical reactions: machine learning barriers of competing mechanisms in reactant space, J. Chem. Phys., 155, 064105, 10.1063/5.0059742 Lu, 2021, Structure-based relative energy prediction model: a case study of Pd(II)-Catalyzed ethylene polymerization and the electronic effect of ancillary ligands, J. Phys. Chem. B, 125, 12047, 10.1021/acs.jpcb.1c05143 Rupp, 2012, Fast and accurate modeling of molecular atomization energies with machine learning, Phys. Rev. Lett., 108, 058301, 10.1103/PhysRevLett.108.058301 Lewis-Atwell, 2022, Machine learning activation energies of chemical reactions, WIREs Comput. Mol. Sci., 12, e1593, 10.1002/wcms.1593 Maley, 2020, Quantum-mechanical transition-state model combined with machine learning provides catalyst design features for selective Cr olefin oligomerization, Chem. Sci., 11, 9665, 10.1039/D0SC03552A Friederich, 2020, Machine learning dihydrogen activation in the chemical space surrounding Vaska's complex, Chem. Sci., 11, 4584, 10.1039/D0SC00445F Pronobis, 2018, Many-body descriptors for predicting molecular properties with machine learning: analysis of pairwise and three-body interactions in molecules, J. Chem. Theor. Comput., 14, 2991, 10.1021/acs.jctc.8b00110 Gallarati, 2021, Reaction-based machine learning representations for predicting the enantioselectivity of organocatalysts, Chem. Sci., 12, 6879, 10.1039/D1SC00482D Song, 2020, Unraveling the energetic significance of chemical events in enzyme catalysis via machine-learning based regression approach, Commun. Chem., 3, 134, 10.1038/s42004-020-00379-w Nandi, 2013, Quantitative structure-activation barrier relationship modeling for Diels-Alder ligations utilizing quantum chemical structural descriptors, Chem. Cent. J., 7, 171, 10.1186/1752-153X-7-171 Clements, 2021, Workflow for biocatalytic reaction performance prediction and analysis, Preprint at ChemRxiv Vargas, 2021, Machine learning to predict diels-alder reaction barriers from the reactant state electron density, J. Chem. Theor. Comput., 17, 6203, 10.1021/acs.jctc.1c00623 Giraldo, 2006, The catalytic power of enzymes: conformational selection or transition state stabilization?, FEBS Lett., 580, 2170, 10.1016/j.febslet.2006.03.060 Ryde, 2017, How many conformations need to Be sampled to obtain converged QM/MM energies? The curse of exponential averaging, J. Chem. Theor. Comput., 13, 5745, 10.1021/acs.jctc.7b00826 Chen, 2016, Theoretical studies on the mechanism of thioesterase-catalyzed macrocyclization in erythromycin biosynthesis, ACS Catal., 6, 4369, 10.1021/acscatal.6b01154 Shi, 2018, Theoretical studies on the catalytic mechanism and substrate diversity for macrocyclization of pikromycin thioesterase, ACS Catal., 8, 4323, 10.1021/acscatal.8b01156 He, 2019, Enzymatic pyran formation involved in xiamenmycin biosynthesis, ACS Catal., 9, 5391, 10.1021/acscatal.9b01034 Liu, 2020, Structure-guided engineering of d-carbamoylase reveals a key loop at substrate entrance tunnel, ACS Catal., 10, 12393, 10.1021/acscatal.0c02942 Jiang, 2021, Computational exploration of how enzyme XimE converts natural S-epoxide to pyran and R-epoxide to furan, ACS Catal., 11, 7928, 10.1021/acscatal.1c01335 Bornscheuer, 2012, Engineering the third wave of biocatalysis, Nature, 485, 185, 10.1038/nature11117 Noey, 2015, Origins of stereoselectivity in evolved ketoreductases, Proc. Natl. Acad. Sci. USA, 112, E7065, 10.1073/pnas.1507910112 Zhou, 2018, Structural insight into enantioselective inversion of an alcohol dehydrogenase reveals a "polar gate" in stereorecognition of diaryl ketones, J. Am. Chem. Soc., 140, 12645, 10.1021/jacs.8b08640 Li, 2021, Single-point mutant inverts the stereoselectivity of a carbonyl reductase toward beta-ketoesters with enhanced activity, Chemistry, 27, 6283, 10.1002/chem.202005195 Wu, 2022, Computer-aided understanding and engineering of enzymatic selectivity, Biotechnol. Adv., 54, 107793, 10.1016/j.biotechadv.2021.107793 Gu, 2021, Evolutionary coupling-inspired engineering of alcohol dehydrogenase reveals the influence of distant sites on its catalytic efficiency for stereospecific synthesis of chiral alcohols, Comput. Struct. Biotechnol. J., 19, 5864, 10.1016/j.csbj.2021.10.031 Sim, 2022, Origin of the unexpected enantioselectivity in the enzymatic reductions of 5-membered-ring heterocyclic ketones catalyzed by Candida parasilosis carbonyl reductases, Catalysts, 12, 1086, 10.3390/catal12101086 Gregorutti, 2017, Correlation and variable importance in random forests, Stat. Comput., 27, 659, 10.1007/s11222-016-9646-1 Yang, 2022, Machine learning prediction of structure-performance relationship in organic synthesis, Chin. J. Chem., 40, 2106, 10.1002/cjoc.202200039 Anandakrishnan, 2012, H++ 3.0: automating pK prediction and the preparation of biomolecular structures for atomistic molecular modeling and simulations, Nucleic Acids Res., 40, W537, 10.1093/nar/gks375 Peters, 2010, Structural survey of zinc-containing proteins and development of the zinc AMBER force field (ZAFF), J. Chem. Theor. Comput., 6, 2935, 10.1021/ct1002626 Ryde, 1995, Molecular dynamics simulations of alcohol dehydrogenase with a four- or five-coordinate catalytic zinc ion, Proteins, 21, 40, 10.1002/prot.340210106 Li, 2016, MCPB.py: a Python based metal center parameter builder, J. Chem. Inf. Model., 56, 599, 10.1021/acs.jcim.5b00674 Maier, 2015, ff14SB: improving the accuracy of protein side chain and backbone parameters from ff99SB, J. Chem. Theor. Comput., 11, 3696, 10.1021/acs.jctc.5b00255 Case, 2005, The Amber biomolecular simulation programs, J. Comput. Chem., 26, 1668, 10.1002/jcc.20290 Frisch, 2016 Zhao, 2008, Theor. Chem. Acc., 120, 215, 10.1007/s00214-007-0310-x Francl, 1982, Self-consistent molecular orbital methods. XXIII. A polarization-type basis set for second-row elements, J. Chem. Phys., 77, 3654, 10.1063/1.444267 Cossi, 2003, Energies, structures, and electronic properties of molecules in solution with the C-PCM solvation model, J. Comput. Chem., 24, 669, 10.1002/jcc.10189 Hratchian, 2005, Using hessian updating to increase the efficiency of a hessian based predictor-corrector reaction path following method, J. Chem. Theor. Comput., 1, 61, 10.1021/ct0499783 Lu, 2012, Multiwfn: a multifunctional wavefunction analyzer, J. Comput. Chem., 33, 580, 10.1002/jcc.22885 Humphrey, 1996, VMD: visual molecular dynamics, J. Mol. Graph., 14, 33, 10.1016/0263-7855(96)00018-5 Torrie, 1977, Nonphysical sampling distributions in Monte Carlo free-energy estimation: umbrella sampling, J. Comput. Phys., 23, 187, 10.1016/0021-9991(77)90121-8 Kästner, 2011, Umbrella sampling, Wiley Interdiscip. Rev. Comput. Mol. Sci., 1, 932, 10.1002/wcms.66 Kumar, 1992, THE weighted histogram analysis method for free-energy calculations on biomolecules. I. The method, J. Comput. Chem., 13, 1011, 10.1002/jcc.540130812 Tibshirani, 1996, Regression shrinkage and selection via the Lasso, J. Roy. Stat. Soc. B, 58, 267 Friedman, 2010, Regularization paths for generalized linear models via coordinate descent, J. Stat. Software, 33, 1 Chang, 2011, Libsvm: a library for support vector machines, ACM Trans. Intell. Syst. Technol., 2, 1, 10.1145/1961189.1961199 Vapnik, 2000