Conserved residue clustering and protein structure prediction

Proteins: Structure, Function and Bioinformatics - Tập 52 Số 2 - Trang 225-235 - 2003
Ora Schueler‐Furman1, David Baker1,2
1Department of Biochemistry, University of Washington, Seattle, Washington
2Howard Hughes Medical Institute, University of Washington, Seattle, Washington

Tóm tắt

AbstractProtein residues that are critical for structure and function are expected to be conserved throughout evolution. Here, we investigate the extent to which these conserved residues are clustered in three‐dimensional protein structures. In 92% of the proteins in a data set of 79 proteins, the most conserved positions in multiple sequence alignments are significantly more clustered than randomly selected sets of positions. The comparison to random subsets is not necessarily appropriate, however, because the signal could be the result of differences in the amino acid composition of sets of conserved residues compared to random subsets (hydrophobic residues tend to be close together in the protein core), or differences in sequence separation of the residues in the different sets. In order to overcome these limits, we compare the degree of clustering of the conserved positions on the native structure and on alternative conformations generated by the de novo structure prediction method Rosetta. For 65% of the 79 proteins, the conserved residues are significantly more clustered in the native structure than in the alternative conformations, indicating that the clustering of conserved residues in protein structures goes beyond that expected purely from sequence locality and composition effects. The differences in the spatial distribution of conserved residues can be utilized in de novo protein structure prediction: We find that for 79% of the proteins, selection of the Rosetta generated conformations with the greatest clustering of the conserved residues significantly enriches the fraction of close‐to‐native structures. Proteins 2003;52:225–235. © 2003 Wiley‐Liss, Inc.

Từ khóa


Tài liệu tham khảo

10.1017/S0033583500004674

10.1126/science.7529940

10.1016/S0959-440X(00)00216-5

10.1016/S0959-440X(02)00283-X

10.1093/protein/2.8.589

10.1006/jmbi.1997.1198

10.1002/prot.340180402

10.1016/S1359-0278(97)00060-6

10.1038/nsb0295-171

Livingstone CD, 1993, Protein sequence alignments: A strategy for the hierarchical analysis of residue conservation, Comput Appl Biosci, 9, 745

10.1006/jmbi.1996.0167

10.1016/S0959-440X(02)00284-1

10.1006/jmbi.2001.4870

10.1006/jmbi.2000.4474

10.1006/jmbi.2001.4540

10.1046/j.1432-1033.2002.02767.x

10.1002/(SICI)1097-0134(20000601)39:4<331::AID-PROT60>3.0.CO;2-A

Ouzounis C, 1998, Are binding residues conserved?, Pac Symp Biocomput, 401

10.1006/jmbi.2001.5327

10.1006/jmbi.1999.3208

10.1038/358086a0

10.1002/prot.340130308

10.1016/S0959-440X(96)80076-5

10.1016/S0959-440X(97)80055-3

10.1002/prot.10053

10.1016/S0959-440X(00)00063-4

10.1016/S1359-0278(96)00048-X

10.1093/protein/6.6.605

10.1002/pro.5560040506

10.1006/jmbi.1997.1595

10.1023/A:1026744431105

10.1006/jmbi.1997.0959

10.1006/jmbi.1999.2702

DunbrackRLJ WangG.PISCES: A protein sequence culling server. Submitted for publication.

10.1093/nar/29.1.214

10.1006/jmbi.2000.4459

10.1093/nar/25.17.3389

10.1002/j.1538-7305.1948.tb01338.x

10.1073/pnas.84.13.4355

10.1002/prot.10146

10.1006/jmbi.1993.1240

10.1016/S0969-2126(98)00045-8

10.1016/S0968-0004(00)89080-5