Exploring the variable space of shallow machine learning models for reversed-phase retention time prediction

Computational and Structural Biotechnology Journal - Tập 21 - Trang 2446-2453 - 2023
Darien Yeung1,2, Victor Spicer2, René P. Zahedi1,2,3,4, Oleg Krokhin1,2,3
1Department of Biochemistry and Medical Genetics, University of Manitoba, 336 BMSB, 745 Bannatyne Avenue, Winnipeg R3E 0J9, Canada
2Manitoba Centre for Proteomics and Systems Biology, University of Manitoba, 799 JBRC, 715 McDermot Avenue, Winnipeg R3E 3P4, Canada
3Department of Internal Medicine, University of Manitoba, 799 JBRC, 715 McDermot Avenue, Winnipeg R3E 3P4, Canada
4CancerCare Manitoba Research Institute, 675 McDermot Avenue, Winnipeg R3E 0V9, Canada

Tài liệu tham khảo

Wimley, 1996, Experimentally determined hydrophobicity scale for proteins at membrane interfaces, Nat Struct Biol, 3, 842, 10.1038/nsb1096-842

Janin, 1979, Surface and inside volumes in globular proteins, Nature, 277, 491, 10.1038/277491a0

Kyte, 1982, A simple method for displaying the hydropathic character of a protein, J Mol Biol, 157, 105, 10.1016/0022-2836(82)90515-0

Meek, 1980, Prediction of peptide retention times in high-pressure liquid chromatography on the basis of amino acid composition, Proc Natl Acad Sci USA, 77, 1632, 10.1073/pnas.77.3.1632

Guo, 1986, Prediction of peptide retention times in reversed-phase high-performance liquid chromatography. I. Determination of retention coefficients of amino acid residues of model synthetic peptides, J Chromatogr, 359, 499, 10.1016/0021-9673(86)80102-9

Meek, 1981, Prediction of peptide retention times in high-pressure liquid chromatography on the basis of amino acid composition, J Chromatogr, 211, 15, 10.1016/S0021-9673(00)81169-3

Wilce, 1995, Physicochemical basis of amino acid hydrophobicity scales: evaluation of four new scales of amino acid hydrophobicity coefficients derived from RP-HPLC of peptides, Anal Chem, 67, 1210, 10.1021/ac00103a012

Houghten, 1987, Effect of positional environmental domains on the variation of high-performance liquid chromatographic peptide retention coefficients, J Chromatogr, 386, 223, 10.1016/S0021-9673(01)94599-6

Tripet, 2007, Requirements for prediction of peptide retention time in reversed-phase high-performance liquid chromatography: hydrophilicity/hydrophobicity of side-chains at the N- and C-termini of peptides are dramatically affected by the end-groups and location, J Chromatogr A, 1141, 212, 10.1016/j.chroma.2006.12.024

Spicer, 2014, N-capping motifs promote interaction of amphipathic helical peptides with hydrophobic surfaces and drastically alter hydrophobicity values of individual amino acids, Anal Chem, 86, 11498, 10.1021/ac503352h

Kovacs, 2006, Determination of intrinsic hydrophilicity/hydrophobicity of amino acid side chains in peptides in the absence of nearest-neighbor or conformational effects, Biopolymers, 84, 283, 10.1002/bip.20417

Krokhin, 2006, Sequence-specific retention calculator. Algorithm for peptide retention prediction in ion-pair RP-HPLC: application to 300- and 100-A pore size C18 sorbents, Anal Chem, 78, 7785, 10.1021/ac060777w

Palmblad, 2002, Prediction of chromatographic retention and protein identification in liquid chromatography/mass spectrometry, Anal Chem, 74, 5826, 10.1021/ac0256890

Klammer, 2007, Improving tandem mass spectrum identification using peptide retention time prediction across diverse chromatography conditions, Anal Chem, 79, 6111, 10.1021/ac070262k

Petritis, 2003, Use of artificial neural networks for the accurate prediction of peptide liquid chromatography elution times in proteome analyses, Anal Chem, 75, 1039, 10.1021/ac0205154

Shinoda, 2006, Prediction of liquid chromatographic retention times of peptides generated by protease digestion of the Escherichia coli proteome using artificial neural networks, J Proteome Res, 5, 3312, 10.1021/pr0602038

Gilar, 2010, Utility of retention prediction model for investigation of peptide separation selectivity in reversed-phase liquid chromatography: impact of concentration of trifluoroacetic acid, column temperature, gradient slope and type of stationary phase, Anal Chem, 82, 265, 10.1021/ac901931c

Krokhin, 2004, An improved model for prediction of retention times of tryptic peptides in ion pair reversed-phase HPLC: its application to protein peptide mapping by off-line HPLC-MALDI MS, Mol Cell Proteom, 3, 908, 10.1074/mcp.M400031-MCP200

Ma, 2018, Improved peptide retention time prediction in liquid chromatography through deep learning, Anal Chem, 90, 10881, 10.1021/acs.analchem.8b02386

Gessulat, 2019, Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning, Nat Methods, 16, 509, 10.1038/s41592-019-0426-7

Muller, 2020, The proteome landscape of the kingdoms of life, Nature, 582, 592, 10.1038/s41586-020-2402-x

Bouwmeester, 2021, DeepLC can predict retention times for peptides that carry as-yet unseen modifications, Nat Methods, 18, 1363, 10.1038/s41592-021-01301-5

Spicer, 2016, 3D HPLC-MS with reversed-phase separation functionality in all three dimensions for large-scale bottom-up proteomics and peptide retention data collection, Anal Chem, 88, 2847, 10.1021/acs.analchem.5b04567

Krokhin, 2009, Peptide retention standards and hydrophobicity indexes in reversed-phase high-performance liquid chromatography of peptides, Anal Chem, 81, 9522, 10.1021/ac9016693

Yeung, 2020, Separation orthogonality in liquid chromatography-mass spectrometry for proteomic applications: comparison of 16 different two-dimensional combinations, Anal Chem, 92, 3904, 10.1021/acs.analchem.9b05407

Dwivedi, 2008, Practical implementation of 2D HPLC scheme with accurate peptide retention prediction in both dimensions for high-throughput bottom-up proteomics, Anal Chem, 80, 7036, 10.1021/ac800984n