Accurate Prediction of Protein Secondary Structural Content

Journal of Protein Chemistry - Tập 20 - Trang 217-220 - 2001
Zong Lin1, Xian-Ming Pan1
1National Laboratory of Biomacromolecules, Institute of Biophysics, Academia Sinica, Beijing, China

Tóm tắt

An improved multiple linear regression (MLR) method is proposed to predict a protein's secondary structural content based on its primary sequence. The amino acid composition, the autocorrelation function, and the interaction function of side-chain mass derived from the primary sequence are taken into account. The average absolute errors of prediction over 704 unrelated proteins with the jackknife test are 0.088, 0.081, and 0.059 with standard deviations 0.073, 0.066, and 0.055 for α-helix, β-sheet, and coil, respectively. That the sum of predicted secondary structure content should be close to 1.0 was introduced as a criterion to evaluate whether the prediction is acceptable. While only the predictions with the sum of predicted secondary structure content between 0.99 and 1.01 are accepted (about 11% of all proteins), the absolute errors are 0.058 for α-helix, 0.054 for β-sheet, and 0.045 for coil.

Tài liệu tham khảo