Database searching and accounting of multiplexed precursor and product ion spectra from the data independent analysis of simple and complex peptide mixtures

Proteomics - Tập 9 Số 6 - Trang 1696-1719 - 2009
Guozhong Li1, Johannes P.C. Vissers2, Jeffrey C. Silva3,1, Dan Golick1, M. V. Gorenstein1, Scott Geromanos1
1Waters Corporation, Milford, MA, USA
2Waters Corporation, Manchester, UK
3Current address: Cell Signaling Technology, Inc., 3 Trask Lane, Danvers, MA 01923, USA

Tóm tắt

Abstract

A novel database search algorithm is presented for the qualitative identification of proteins over a wide dynamic range, both in simple and complex biological samples. The algorithm has been designed for the analysis of data originating from data independent acquisitions, whereby multiple precursor ions are fragmented simultaneously. Measurements used by the algorithm include retention time, ion intensities, charge state, and accurate masses on both precursor and product ions from LC‐MS data. The search algorithm uses an iterative process whereby each iteration incrementally increases the selectivity, specificity, and sensitivity of the overall strategy. Increased specificity is obtained by utilizing a subset database search approach, whereby for each subsequent stage of the search, only those peptides from securely identified proteins are queried. Tentative peptide and protein identifications are ranked and scored by their relative correlation to a number of models of known and empirically derived physicochemical attributes of proteins and peptides. In addition, the algorithm utilizes decoy database techniques for automatically determining the false positive identification rates. The search algorithm has been tested by comparing the search results from a four‐protein mixture, the same four‐protein mixture spiked into a complex biological background, and a variety of other “system” type protein digest mixtures. The method was validated independently by data dependent methods, while concurrently relying on replication and selectivity. Comparisons were also performed with other commercially and publicly available peptide fragmentation search algorithms. The presented results demonstrate the ability to correctly identify peptides and proteins from data independent acquisition strategies with high sensitivity and specificity. They also illustrate a more comprehensive analysis of the samples studied; providing approximately 20% more protein identifications, compared to a more conventional data directed approach using the same identification criteria, with a concurrent increase in both sequence coverage and the number of modified peptides.

Từ khóa


Tài liệu tham khảo

10.1073/pnas.90.11.5011

10.1006/bbrc.1993.2009

10.1002/bms.1200220605

10.1016/0960-9822(93)90195-T

10.1006/abio.1993.1514

10.1021/ac00076a008

10.1073/pnas.93.16.8264

10.1021/ac00096a002

10.1073/pnas.93.25.14440

10.1038/85686

Delahunty C., 2005, Protein identification using 2D‐LC‐MS/MS, Mass Spectrom. Proteomics, 35, 209

10.1021/ar00047a008

10.1021/cr990076h

10.1038/nature01511

10.1016/j.bbapap.2006.10.003

10.1007/s10529-006-9065-z

10.1002/pmic.200800562

10.1016/1044-0305(94)80016-2

10.1021/pr049882h

10.1002/(SICI)1522-2683(19991201)20:18<3551::AID-ELPS3551>3.0.CO;2-2

10.1002/cfg.370

10.1021/ac025826t

10.1038/nbt0303-262

Beynon R. J., 2001, Proteolytic Enzymes: A Practical Approach, 149, 10.1093/oso/9780199636631.001.0001

10.1074/mcp.T400003-MCP200

10.1038/nmeth1019

10.1021/pr070492f

Fenyo D. Ossipova E. Eriksson J. On the peptide fragment mass information required to identify peptides. HUPO 7th Annual World Congress Amsterdam2008 poster P‐TUE‐147.

10.1089/153623102760092805

10.1021/pr015514r

10.1016/S1044-0305(00)00097-0

10.1021/ac026424o

10.1002/pmic.200300652

10.1002/pmic.200300485

10.1021/ac0155512

10.1074/mcp.T400004-MCP200

10.1074/mcp.M400031-MCP200

10.1038/nmeth785

10.1021/ac035229m

10.1021/pr070203n

10.1021/pr060102

10.1074/mcp.M500230-MCP200

10.1074/mcp.M500061-MCP200

10.1016/S1044-0305(02)00420-8

10.1021/ac048455k

10.1021/pr0155174

10.1021/ac970896z

10.1021/ac060143p

10.1016/S0021-9673(00)90564-8

10.1002/pmic.200400973

10.1073/pnas.77.3.1632

10.1021/ac0256890

10.1002/mas.1280060102

10.1021/ac00068a024

10.1002/1096-9888(200012)35:12<1399::AID-JMS86>3.0.CO;2-R

10.1021/ac049951b

10.1038/nbt930

10.1021/ac050857k

10.1101/gr.473902

10.1021/ac0498563

10.1021/ac0488513

10.1074/mcp.M500084-MCP200

10.1074/mcp.M500141-MCP200

10.1074/mcp.T400004-MCP200

10.1093/bioinformatics/bth092

10.1021/ac0700833

10.1021/pr800307m

10.1021/ac034616t

10.1021/ac0700272

10.1002/rcm.3150

10.1002/mas.1280140104

Li G.‐Z. Golick D. Gorenstein M. V. Vissers J. P. C.et al. A novel “ion accounting” algorithm for protein database searches. HUPO 5th Annual World Congress Long Beach CA2006 Abstract 658.

10.1074/mcp.M600303-MCP200

10.1172/JCI117084