Function prediction and protein networks

Current Opinion in Cell Biology - Tập 15 - Trang 191-198 - 2003
Martijn A Huynen1, Berend Snel1, Christian von Mering2, Peer Bork2
1Nijmegen Center for Molecular Life Sciences, Center for Molecular and Biomolecular Informatics, Toernooiveld 1, 6525 ED Nijmegen, The Netherlands
2European Molecular Biology Laboratory, Meyerhofstrasse 1, 69117 Heidelberg, Germany

Tài liệu tham khảo

Bork, 1998, Predicting function: from genes to genomes and back, J. Mol. Biol., 283, 707, 10.1006/jmbi.1998.2144 Uetz, 2000, A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae, Nature, 403, 623, 10.1038/35001009 Gavin, 2002, Functional organization of the yeast proteome by systematic analysis of protein complexes, Nature, 415, 141, 10.1038/415141a Marcotte, 1999, Detecting protein function and protein–protein interactions from genome sequences, Science, 285, 751, 10.1126/science.285.5428.751 Enright, 1999, Protein interaction maps for complete genomes based on gene fusion events, Nature, 402, 86, 10.1038/47056 Overbeek, 1998, Use of contiguity on the chromosome to predict functional coupling, In Silico Biol., 1, 93 Dandekar, 1998, Conservation of gene order: a fingerprint of proteins that physically interact, Trends Biochem. Sci., 23, 324, 10.1016/S0968-0004(98)01274-2 Pellegrini, 1999, Assigning protein functions by comparative genome analysis: protein phylogenetic profiles, Proc. Natl. Acad. Sci. USA, 96, 4285, 10.1073/pnas.96.8.4285 Huynen, 1998, Measuring genome evolution, Proc. Natl. Acad. Sci. USA, 95, 5849, 10.1073/pnas.95.11.5849 Pazos, 2001, Similarity of phylogenetic trees as indicator of protein-protein interaction, Protein Eng., 14, 609, 10.1093/protein/14.9.609 McGuire, 2000, Conservation of DNA regulatory motifs and discovery of new motifs in microbial genomes, Genome Res., 10, 744, 10.1101/gr.10.6.744 van Nimwegen, 2002, Probabilistic clustering of sequences: inferring new bacterial regulons by comparative genomics, Proc. Natl. Acad. Sci. USA, 99, 7323, 10.1073/pnas.112690399 Marcotte, 2000, Computational genetics: finding protein function by nonhomology methods, Curr. Opin. Struct. Biol., 10, 359, 10.1016/S0959-440X(00)00097-X Galperin, 2000, Who’s your neighbor? New computational approaches for functional genomics, Nat. Biotechnol., 18, 609, 10.1038/76443 Valencia, 2002, Computational methods for the prediction of protein interactions, Curr. Opin. Struct. Biol., 12, 368, 10.1016/S0959-440X(02)00333-0 Huynen, 2000, Exploitation of gene context, Curr. Opin. Struct. Biol., 10, 366, 10.1016/S0959-440X(00)00098-1 Huynen, 2000, Predicting protein function by genomic context: quantitative evaluation and qualitative inferences, Genome Res., 10, 1204, 10.1101/gr.10.8.1204 Yanai, 2001, Genes linked by fusion events are generally of the same functional category: a systematic analysis of 30 microbial genomes, Proc. Natl. Acad. Sci. USA, 98, 7940, 10.1073/pnas.141236298 von Mering, 2003, STRING: a database of predicted functional associations between proteins, Nucleic Acids Res., 31, 258, 10.1093/nar/gkg034 Yanai, 2002, Identifying functional links between genes using conserved chromosomal proximity, Trends Genet., 18, 176, 10.1016/S0168-9525(01)02621-X Bairoch, 2000, The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000, Nucleic Acids Res., 28, 45, 10.1093/nar/28.1.45 Kanehisa, 2002, The KEGG databases at GenomeNet, Nucleic Acids Res., 30, 42, 10.1093/nar/30.1.42 von Mering, 2002, Comparative assessment of large-scale data sets of protein-protein interactions, Nature, 417, 399, 10.1038/nature750 Eisenberg, 2000, Protein function in the post-genomic era, Nature, 405, 823, 10.1038/35015694 Huynen MA, Snel B: Exploiting the variations in the genomic associations of genes to predict pathways and reconstruct their evolution. In Frontiers in Computational Genomics. Edited by Galperin MY, Koonin EV. Norfolk: Caisters Academic Press; 2003:145-166. Campuzano, 1996, Friedreich’s ataxia: autosomal recessive disease caused by an intronic GAA triplet repeat expansion, Science, 271, 1423, 10.1126/science.271.5254.1423 Huynen, 2001, The phylogenetic distribution of frataxin indicates a role in iron-sulfur cluster protein assembly, Hum. Mol. Genet., 10, 2463, 10.1093/hmg/10.21.2463 Muhlenhoff, 2002, The yeast frataxin homolog Yfh1p plays a specific role in the maturation of cellular Fe/S proteins, Hum. Mol. Genet., 11, 2025, 10.1093/hmg/11.17.2025 Chen, 2002, Inhibition of Fe-S cluster biosynthesis decreases mitochondrial iron export: Evidence that Yfh1p affects Fe-S cluster synthesis, Proc. Natl. Acad. Sci. USA, 99, 12321, 10.1073/pnas.192449599 Duby, 2002, A non-essential function for yeast frataxin in iron-sulfur cluster assembly, Hum. Mol. Genet., 11, 2635, 10.1093/hmg/11.21.2635 Jeong, 2001, Lethality and centrality in protein networks, Nature, 411, 41, 10.1038/35075138 Fraser, 2002, Evolutionary rate in the protein interaction network, Science, 296, 750, 10.1126/science.1068696 Jeong, 2000, The large-scale organization of metabolic networks, Nature, 407, 651, 10.1038/35036627 Wolf, 2002, Scale-free networks in biology: new insights into the fundamentals of evolution?, Bioessays, 24, 105, 10.1002/bies.10059 Maslov, 2002, Specificity and stability in topology of protein networks, Science, 296, 910, 10.1126/science.1065103 Aloy, 2002, Potential artefacts in protein-interaction networks, FEBS Lett., 530, 253, 10.1016/S0014-5793(02)03427-0 Koonin, 2001, Prediction of the archaeal exosome and its connections with the proteasome and the translation and transcription machineries by a comparative-genomic approach, Genome Res., 11, 240, 10.1101/gr.162001 Snel, 2002, The identification of functional modules from the genomic association of genes, Proc. Natl. Acad. Sci. USA, 99, 5890, 10.1073/pnas.092632599 Rogozin, 2002, Connected gene neighborhoods in prokaryotic genomes, Nucleic Acids Res., 30, 2212, 10.1093/nar/30.10.2212 Overbeek, 2000, WIT: integrated system for high-throughput genome sequence analysis and metabolic reconstruction, Nucleic Acids Res., 28, 123, 10.1093/nar/28.1.123 Ravasz, 2002, Hierarchical organization of modularity in metabolic networks, Science, 297, 1551, 10.1126/science.1073374 Wu, 2002, Large-scale prediction of Saccharomyces cerevisiae gene function using overlapping transcriptional clusters, Nat. Genet., 31, 255, 10.1038/ng906 Ihmels, 2002, Revealing modular organization in the yeast transcriptional network, Nat. Genet., 31, 370, 10.1038/ng941 Snel, 2002, Conservation of gene co-regulation in prokaryotes and eukaryotes, Trends Biotechnol., 20, 410, 10.1016/S0167-7799(02)02040-1 Bobik, 2001, Identification of the human methylmalonyl-CoA racemase gene based on the analysis of prokaryotic gene arrangements. Implications for decoding the human genome, J. Biol. Chem., 276, 37194, 10.1074/jbc.M107232200 Weller, 2002, Identification of a DNA nonhomologous end-joining complex in bacteria, Science, 297, 1686, 10.1126/science.1074584 Blumenthal, 2002, A global analysis of Caenorhabditis elegans operons, Nature, 417, 851, 10.1038/nature00831 Roy, 2002, Chromosomal clustering of muscle-expressed genes in Caenorhabditis elegans, Nature, 418, 975, 10.1038/nature01012 Caron, 2001, The human transcriptome map: clustering of highly expressed genes in chromosomal domains, Science, 291, 1289, 10.1126/science.1056794 Lercher, 2002, Clustering of housekeeping genes provides a unified model of gene order in the human genome, Nat. Genet., 31, 180, 10.1038/ng887 Teichmann, 2002, Conservation of gene co-regulation in prokaryotes and eukaryotes, Trends Biotechnol., 20, 407, 10.1016/S0167-7799(02)02032-2 Hurst, 2002, Natural selection promotes the conservation of linkage of co-expressed genes, Trends Genet., 18, 604, 10.1016/S0168-9525(02)02813-5 Wolf, 2001, Genome alignment, evolution of prokaryotic genome organization, and prediction of gene function using genomic context, Genome Res., 11, 356, 10.1101/gr.GR-1619R Tatusov, 2001, The COG database: new developments in phylogenetic classification of proteins from complete genomes, Nucleic Acids Res., 29, 22, 10.1093/nar/29.1.22 Lee, 2002, Transcriptional regulatory networks in Saccharomyces cerevisiae, Science, 298, 799, 10.1126/science.1075090 Milo, 2002, Network motifs: simple building blocks of complex networks, Science, 298, 824, 10.1126/science.298.5594.824 Thomas, 2000, The glnKamtB operon. A conserved gene pair in prkaryotes, Trends Genet., 16, 11, 10.1016/S0168-9525(99)01887-9 Coutts, 2002, Membrane sequestration of the signal transduction protein GlnK by the ammonium transporter AmtB, EMBO J., 21, 536, 10.1093/emboj/21.4.536 Horswill, 2001, In vitro conversion of propionate to pyruvate by Salmonella enterica enzymes: 2-methylcitrate dehydratase (PrpD) and aconitase enzymes catalyze the conversion of 2-methylcitrate to 2-methylisocitrate, Biochemistry, 40, 4703, 10.1021/bi015503b Daugherty, 2001, Archaeal shikimate kinase, a new member of the GHMP-kinase family, J. Bacteriol., 183, 292, 10.1128/JB.183.1.292-300.2001 Graham, 2001, Identification of coenzyme M biosynthetic 2-phosphosulfolactate phosphatase. A member of a new class of Mg(2+)-dependent acid phosphatases, Eur. J. Biochem., 268, 5176, 10.1046/j.0014-2956.2001.02451.x Luttgen, 2000, Biosynthesis of terpenoids: YchB protein of Escherichia coli phosphorylates the 2-hydroxy group of 4-diphosphocytidyl-2C-methyl-D-erythritol, Proc. Natl. Acad. Sci. USA, 97, 1062, 10.1073/pnas.97.3.1062 Karzai, 1999, SmpB, a unique RNA-binding protein essential for the peptide-tagging activity of SsrA (tmRNA), EMBO J., 18, 3793, 10.1093/emboj/18.13.3793 Myllykallio, 2002, An alternative flavin-dependent mechanism for thymidylate synthesis, Science, 297, 105, 10.1126/science.1072113 Rouhier, 2001, Isolation and characterization of a new peroxiredoxin from poplar sieve tubes that uses either glutaredoxin or thioredoxin as a proton donor, Plant Physiol., 127, 1299, 10.1104/pp.010586 Herz, 2000, Biosynthesis of terpenoids: YgbB protein converts 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate to 2C-methyl-D-erythritol 2,4-cyclodiphosphate, Proc. Natl. Acad. Sci. USA, 97, 2486, 10.1073/pnas.040554697 Huynen, 2000, Predicting protein function by genomic context: quantitative evaluation and qualitative inferences, Genome Res., 10, 1204, 10.1101/gr.10.8.1204 Kryukov, 2002, Selenoprotein R is a zinc-containing stereo-specific methionine sulfoxide reductase, Proc. Natl. Acad. Sci. USA, 99, 4245, 10.1073/pnas.072603099