Prediction of reversibly oxidized protein cysteine thiols using protein structure properties

Protein Science - Tập 17 Số 3 - Trang 473-481 - 2008
Ricardo Sánchez1, Megan Riddle1, Jongwook Woo2, Jamil Momand3,1
1Department of Chemistry and Biochemistry, California State University Los Angeles, California 90032 USA
2Department of Computer Information Systems, California State University, Los Angeles, California 90032, USA
3Department of Chemistry and Biochemistry, California State University, 5151 State University Drive, Los Angeles, CA 90032, USA; fax:(323) 343‐2499.

Tóm tắt

Abstract

Protein cysteine thiols can be divided into four groups based on their reactivities: those that form permanent structural disulfide bonds, those that coordinate with metals, those that remain in the reduced state, and those that are susceptible to reversible oxidation. Physicochemical parameters of oxidation‐susceptible protein thiols were organized into a database named the Balanced Oxidation Susceptible Cysteine Thiol Database (BALOSCTdb). BALOSCTdb contains 161 cysteine thiols that undergo reversible oxidation and 161 cysteine thiols that are not susceptible to oxidation. Each cysteine was represented by a set of 12 parameters, one of which was a label (1/0) to indicate whether its thiol moiety is susceptible to oxidation. A computer program (the C4.5 decision tree classifier re‐implemented as the J48 classifier) segregated cysteines into oxidation‐susceptible and oxidation‐non‐susceptible classes. The classifier selected three parameters critical for prediction of thiol oxidation susceptibility: (1) distance to the nearest cysteine sulfur atom, (2) solvent accessibility, and (3) pKa. The classifier was optimized to correctly predict 136 of the 161 cysteine thiols susceptible to oxidation. Leave‐one‐out cross‐validation analysis showed that the percent of correctly classified cysteines was 80.1% and that 16.1% of the oxidation‐susceptible cysteine thiols were incorrectly classified. The algorithm developed from these parameters, named the Cysteine Oxidation Prediction Algorithm (COPA), is presented here. COPA prediction of oxidation‐susceptible sites can be utilized to locate protein cysteines susceptible to redox‐mediated regulation and identify possible enzyme catalytic sites with reactive cysteine thiols.

Từ khóa


Tài liệu tham khảo

10.1016/S0022-2836(05)80360-2

10.1073/pnas.96.11.6161

10.2741/1187

10.1152/ajpheart.01162.2006

10.1093/nar/gkl266

10.1016/S0092-8674(01)00300-2

10.1096/fj.05-4875fje

10.1016/0167-4889(87)90112-1

10.1021/bi020200i

10.1186/1471-2091-7-18

10.1093/nar/gkl189

10.1016/S0955-0674(03)00002-4

10.1093/bioinformatics/16.3.251

10.1073/pnas.102483199

10.1042/bj20011251

10.1042/BST0331378

10.1006/abbi.1998.1013

10.1146/annurev.biochem.72.121801.161459

10.1016/j.molmed.2005.10.006

10.1002/0471687545

10.1016/0022-2836(71)90324-X

10.1074/jbc.M111899200

10.1038/nsmb856

10.1002/prot.20660

10.1089/152308603768295168

10.1139/o06-186

10.1042/bj3550145

10.1515/BC.2002.067

10.1110/ps.0219602

10.1093/nar/gki365

10.1093/nar/gkh376

10.1046/j.1365-2958.2002.03116.x

10.1146/annurev.pharmtox.44.101802.121735

Quinlan J.R., 1993, C4.5: Programs for machine learning

10.1613/jair.279

Research Collaboratory for Structural Bioinformatics2003.RCSB PDB Newsletter.http://www.resb.org/pdb/static.do?p=general_information/news_publications/newsletters/2002q4/index.html.

2007.RCSB PDB Newsletter.http://www.rcsb.org/pdb/general_information/news_publications/newsletters/2006q4/deposition.html#satistics.

10.1006/abbi.1995.1261

Witten I.A., 2005, Data mining: Practical machine learning tools and techniques

10.1126/science.1080273

10.1126/science.1080405

10.1021/bi060711v

10.1021/bi00087a012

10.1126/science.279.5357.1718