Prediction of MHC Class I Binding Peptides by a Query Learning Algorithm Based on Hidden Markov Models

Journal of Biological Physics - Tập 28 - Trang 183-194 - 2002
Keiko Udaka1, Hiroshi Mamitsuka2, Yukinobu Nakaseko1, Naoki Abe2
1Department of Biophysics, Kyoto University, Japan.
2Theory NEC Laboratory, RWCP (Real Worid Computing Partnership), c/o Internet Systems Research Laboratories, NEC corporation, Japan

Tóm tắt

A query learning algorithm based on hidden Markov models (HMMs) isdeveloped to design experiments for string analysis and prediction of MHCclass I binding peptides. Query learning is introduced to aim at reducingthe number of peptide binding data for training of HMMs. A multiple numberof HMMs, which will collectively serve as a committee, are trained withbinding data and used for prediction in real-number values. The universeof peptides is randomly sampled and subjected to judgement by the HMMs.Peptides whose prediction is least consistent among committee HMMs aretested by experiment. By iterating the feedback cycle of computationalanalysis and experiment the most wanted information is effectivelyextracted. After 7 rounds of active learning with 181 peptides in all,predictive performance of the algorithm surpassed the so far bestperforming matrix based prediction. Moreover, by combining the bothmethods binder peptides (log Kd < -6) could be predicted with84% accuracy. Parameter distribution of the HMMs that can be inspectedvisually after training further offers a glimpse of dynamic specificity ofthe MHC molecules.

Tài liệu tham khảo

Rammensee, H., Friede, T. and Stevanovic, S.: MHC ligands and peptide motifs: first listing, Immunogenetics 41 (1995), 178–228.

Hammer, J., et al.: Precise prediction of MHC class II-peptide interaction based on peptide side chain scanning, J. Exp. Med. 180 (1994), 2353–2358.

Udaka, K., Wiesmuller, K.-H., Kienle, S., Jung, G. and Walden, P.: Tolerance to amino acid variations in peptides binding to the MHC class I protein H-2Kb, J. Biol. Chem. 270 (1995), 24130–24134.

Udaka, K., Wiesmueller, K.-H., Kienle, S., Jung, G. and Walden, P.: Decrypting the structure of MHC-I restricted CTL epitopes with complex peptide libraries, J. Exp. Med. 181 (1995), 2097–2108.

Abe, N. and Mamitsuka, H.: In the fifteenth international conference on machine learning, Morgan Kaufmann, Madison, Wisconsin, 1998.

Mamitsuka, H. and Abe, N.: In the seventeenth international conference on machine learning, Morgan Kaufmann, Stanford, Ca., 2000.

Krogh, A., Brown, M., Mian, I.S., Sjolander, K. and Haussler, D.: Hidden Markov models in computational biology: applications to protein modeling, J. Mol. Biol. 235 (1994), 1501–1531.

Krogh, A., I.S., M. and D., H.: A hidden Markov model that finds genes in E. coli DNA, Nucl. Acids Res. 22 (1994), 4768–4778.