Automatic clustering of orthologs and in-paralogs from pairwise species comparisons

Journal of Molecular Biology - Tập 314 - Trang 1041-1052 - 2001
Maido Remm1,2, Christian E.V. Storm1, Erik L.L. Sonnhammer1
1Center for Genomics and Bioinformatics, Karolinska Institutet, S-17177 Stockholm Sweden
2Estonian Biocentre, Riia 23, Tartu, 51010, Estonia

Tài liệu tham khảo

Fitch, 1970, Distinguishing homologous from analogous proteins, Syst. Zool., 19, 99, 10.2307/2412448 Yuan, 1998, Towards detection of orthologues in sequence databases, Bioinformatics, 14, 285, 10.1093/bioinformatics/14.3.285 Chervitz, 1998, Comparison of the complete protein sets of worm and yeast, Science, 282, 2022, 10.1126/science.282.5396.2022 Rubin, 2000, Comparative genomics of the eukaryotes, Science, 287, 2204, 10.1126/science.287.5461.2204 Wheelan, 1999, Human and nematode orthologs - lessons from the analysis of 1800 human genes and the proteome of Caenorhabditis elegans, Gene, 238, 163, 10.1016/S0378-1119(99)00298-X Mushegian, 1998, Large-scale taxonomic profiling of eukaryotic model organisms, Genome Res., 8, 590, 10.1101/gr.8.6.590 Tatusov, 1996, Metabolism and evolution of Haemophilus influenzae deduced from a whole-genome comparison with Escherichia coli, Curr. Biol., 6, 279, 10.1016/S0960-9822(02)00478-5 Tatusov, 1997, A genomic perspective on protein families, Science, 278, 631, 10.1126/science.278.5338.631 Tatusov, 2001, The COG database, Nucl. Acids Res., 29, 22, 10.1093/nar/29.1.22 Tatusov, 2000, The COG database, Nucl. Acids Res., 28, 33, 10.1093/nar/28.1.33 Makalowski, 1998, Evolutionary parameters of the transcribed mammalian genome, Proc. Natl Acad. Sci. USA, 95, 9407, 10.1073/pnas.95.16.9407 Remm, 2000, Classification of transmembrane protein families in the Caenorhabditis elegans genome and identification of human orthologs, Genome Res., 10, 1679, 10.1101/gr.GR-1491R Zharkikh, 1995, Estimation of confidence in phylogeny, Mol. Phylogenet. Evol., 4, 44, 10.1006/mpev.1995.1005