Using synthetic peptides to benchmark peptide identification software and search parameters for MS/MS data analysis

EuPA Open Proteomics - Tập 5 - Trang 21-31 - 2014
Andreas Quandt1, Lucia Espona1,2, Akos Balasko3, Hendrik Weisser1, Mi-Youn Brusniak4, Peter Kunszt2, Ruedi Aebersold1,5, Lars Malmström1
1Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Switzerland
2SyBIT, SystemsX.ch, Switzerland
3MTA SZTAKI, Laboratory of Parallel and Distributed Systems, Budapest, Hungary
4Institute for Systems Biology, Seattle, USA
5Faculty of Science, University of Zurich, Switzerland

Tài liệu tham khảo

Nesvizhskii, 2007, Analysis and validation of proteomic data generated by tandem mass spectrometry, Nat Methods, 4, 787, 10.1038/nmeth1088 Matthiesen, 2007, Methods, algorithms and tools in computational proteomics: a practical point of view, Proteomics, 7, 2815, 10.1002/pmic.200700116 Elias, 2007, Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry, Nat Methods, 4, 207, 10.1038/nmeth1019 Benjamini, 1995, Controlling the false discovery rate: a practical and powerful approach to multiple testing, J R Stat Soc B Met, 57, 289 Frank, 2005, PepNovo: de novo peptide sequencing via probabilistic network modeling, Anal Chem, 77, 964, 10.1021/ac048788h Lam, 2007, Development and validation of a spectral library searching method for peptide identification from MS/MS, Proteomics, 7, 655, 10.1002/pmic.200600625 Keller, 2005, A uniform proteomics MS/MS analysis platform utilizing open XML file formats, Mol Syst Biol, 1, 0017, 10.1038/msb4100024 Cottingham, 2005, Manual validation is a hot proteomics topic, Anal Chem, 77, 92, 10.1021/ac053349j Deutsch, 2010, Trans-proteomic pipeline supports and improves analysis of electron transfer dissociation data sets, Proteomics, 10, 1190, 10.1002/pmic.200900567 Kall, 2007, Semi-supervised learning for peptide identification from shotgun proteomics datasets, Nat Methods, 4, 923, 10.1038/nmeth1113 Keller, 2002, Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search, Anal Chem, 74, 5383, 10.1021/ac025747h Tanner, 2005, InsPecT: identification of posttranslationally modified peptides from tandem mass spectra, Anal Chem, 77, 4626, 10.1021/ac050102d Eng, 1994, An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database, J Am Soc Mass Spectrom, 5, 976, 10.1016/1044-0305(94)80016-2 Colinge, 2003, OLAV: towards high-throughput tandem mass spectrometry data identification, Proteomics, 3, 1454, 10.1002/pmic.200300485 Tabb, 2007, MyriMatch: highly accurate tandem mass spectral peptide identification by multivariate hypergeometric analysis, J Proteome Res, 6, 654, 10.1021/pr0604054 Perkins, 1999, Probability-based protein identification by searching sequence databases using mass spectrometry data, Electrophoresis, 20, 3551, 10.1002/(SICI)1522-2683(19991201)20:18<3551::AID-ELPS3551>3.0.CO;2-2 Craig, 2004, TANDEM: matching proteins with tandem mass spectra, Bioinformatics, 20, 1466, 10.1093/bioinformatics/bth092 Geer, 2004, Open mass spectrometry search algorithm, J Proteome Res, 3, 958, 10.1021/pr0499491 Quandt, 2009, SwissPIT: an workflow-based platform for analyzing tandem-MS spectra using the Grid, Proteomics, 9, 2648, 10.1002/pmic.200800207 Shteynberg, 2011, iProphet: multi-level integrative analysis of shotgun proteomic data improves peptide and protein identification rates and error estimates, Mol Cell Proteomics, 10, 10.1074/mcp.M111.007690 Park, 2008, Rapid and accurate peptide identification from tandem mass spectra, J Proteome Res, 7, 3022, 10.1021/pr800127y Nahnsen, 2011, Probabilistic consensus scoring improves tandem mass spectrometry peptide identification, J Proteome Res, 10, 3332, 10.1021/pr2002879 Tabb, 2008, DirecTag: accurate sequence tags from peptide MS/MS through statistical scoring, J Proteome Res, 7, 3838, 10.1021/pr800154p Klimek, 2008, The standard protein mix database: a diverse data set to assist in the production of improved peptide and protein identification software tools, J Proteome Res, 7, 96, 10.1021/pr070244j Ivanov, 2013, Interlaboratory studies and initiatives developing standards for proteomics, Proteomics, 13, 904, 10.1002/pmic.201200532 Marx, 2013, A large synthetic peptide and phosphopeptide reference library for mass spectrometry-based proteomics, Nat Biotechnol, 31, 557, 10.1038/nbt.2585 Picotti, 2007, The implications of proteolytic background for shotgun proteomics, Mol Cell Proteomics, 6, 1589, 10.1074/mcp.M700029-MCP200 Frank, 2002, The SPOT-synthesis technique. Synthetic peptide arrays on membrane supports – principles and applications, J Immunol Methods, 267, 13, 10.1016/S0022-1759(02)00137-0 2009, The Universal Protein Resource (UniProt) 2009, Nucleic Acids Res, 37, D169 Colinge, 2006, InSilicoSpectro: an open-source proteomics library, J Proteome Res, 5, 619, 10.1021/pr0504236 Pedrioli, 2004, A common open representation of mass spectrometry data and its application to proteomics research, Nat Biotechnol, 22, 1459, 10.1038/nbt1031 SyBIT http://www.sybit.net. MySQL http://www.mysql.com. Kessner, 2008, ProteoWizard: open source software for rapid proteomics tools development, Bioinformatics, 24, 2534, 10.1093/bioinformatics/btn323 Weisser, 2013, An automated pipeline for high-throughput label-free quantitative proteomics, J Proteome Res, 12, 1628, 10.1021/pr300992u Farkas, 2011, P-GRADE portal: a generic workflow system to support user communities, Future Gener Comput Syst, 27, 454, 10.1016/j.future.2010.12.001 Picotti, 2010, High-throughput generation of selected reaction-monitoring assays for proteins and proteomes, Nat Methods, 7, 43, 10.1038/nmeth.1408 Sturm, 2008, OpenMS – an open-source software framework for mass spectrometry, BMC Bioinform, 9, 163, 10.1186/1471-2105-9-163 Granholm, 2011, On using samples of known protein content to assess the statistical calibration of scores assigned to peptide-spectrum matches in shotgun proteomics, J Proteome Res, 10, 2671, 10.1021/pr1012619 Vaudel, 2012, A complex standard for protein identification, designed by evolution, J Proteome Res, 11, 5065, 10.1021/pr300055q Beausoleil, 2006, A probability-based approach for high-throughput protein phosphorylation analysis and site localization, Nat Biotechnol, 24, 1285, 10.1038/nbt1240 Ma, 2012, A statistical model-building perspective to identification of MS/MS spectra with PeptideProphet, BMC Bioinform, 13, S1, 10.1186/1471-2105-13-S16-S1 Colaert, 2011, Analysis of the resolution limitations of peptide identification algorithms, J Proteome Res, 10, 5555, 10.1021/pr200913a