Implication of the cause of differences in 3D structures of proteins with high sequence identity based on analyses of amino acid sequences and 3D structures

Springer Science and Business Media LLC - Tập 7 - Trang 1-13 - 2014
Masanari Matsuoka1,2, Masatake Sugita1, Takeshi Kikuchi1
1Department of Bioinformatics, College of Life Sciences, Ritsumeikan University, Kusatsu, Japan
2Japan Society for the Promotion of Science (JSPS) Research Fellow DC2, Tokyo, Japan

Tóm tắt

Proteins that share a high sequence homology while exhibiting drastically different 3D structures are investigated in this study. Recently, artificial proteins related to the sequences of the GA and IgG binding GB domains of human serum albumin have been designed. These artificial proteins, referred to as GA and GB, share 98% amino acid sequence identity but exhibit different 3D structures, namely, a 3α bundle versus a 4β + α structure. Discriminating between their 3D structures based on their amino acid sequences is a very difficult problem. In the present work, in addition to using bioinformatics techniques, an analysis based on inter-residue average distance statistics is used to address this problem. It was hard to distinguish which structure a given sequence would take only with the results of ordinary analyses like BLAST and conservation analyses. However, in addition to these analyses, with the analysis based on the inter-residue average distance statistics and our sequence tendency analysis, we could infer which part would play an important role in its structural formation. The results suggest possible determinants of the different 3D structures for sequences with high sequence identity. The possibility of discriminating between the 3D structures based on the given sequences is also discussed.

Tài liệu tham khảo

Daggett V, Fersht AR: Transition states in protein folding. Mechanisms of protein folding. Edited by: Pain RH. 2000, New York: Oxford University Press, 175-211. 2