Lists2Networks: Integrated analysis of gene/protein lists
Tóm tắt
Systems biologists are faced with the difficultly of analyzing results from large-scale studies that profile the activity of many genes, RNAs and proteins, applied in different experiments, under different conditions, and reported in different publications. To address this challenge it is desirable to compare the results from different related studies such as mRNA expression microarrays, genome-wide ChIP-X, RNAi screens, proteomics and phosphoproteomics experiments in a coherent global framework. In addition, linking high-content multilayered experimental results with prior biological knowledge can be useful for identifying functional themes and form novel hypotheses. We present Lists2Networks, a web-based system that allows users to upload lists of mammalian genes/proteins onto a server-based program for integrated analysis. The system includes web-based tools to manipulate lists with different set operations, to expand lists using existing mammalian networks of protein-protein interactions, co-expression correlation, or background knowledge co-annotation correlation, as well as to apply gene-list enrichment analyses against many gene-list libraries of prior biological knowledge such as pathways, gene ontology terms, kinase-substrate, microRNA-mRAN, and protein-protein interactions, metabolites, and protein domains. Such analyses can be applied to several lists at once against many prior knowledge libraries of gene-lists associated with specific annotations. The system also contains features that allow users to export networks and share lists with other users of the system. Lists2Networks is a user friendly web-based software system expected to significantly ease the computational analysis process for experimental systems biologists employing high-throughput experiments at multiple layers of regulation. The system is freely available at
http://www.lists2networks.org
.
Tài liệu tham khảo
Ma'ayan A: Network integration and graph analysis in mammalian molecular systems biology. IET Systems Biology 2008, 2(5):206–221. 10.1049/iet-syb:20070075
Nam D, Kim S-Y: Gene-set approach for expression pattern analysis. Brief Bioinform 2008, 9(3):189–197. 10.1093/bib/bbn001
Huang DW, Sherman BT, Lempicki RA: Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucl Acids Res 2009, 37(1):1–13. 10.1093/nar/gkn923
Mootha VK, Lindgren CM, Eriksson K-F, Subramanian A, Sihag S, Lehar J, Puigserver P, Carlsson E, Ridderstrale M, Laurila E, et al.: PGC-1[alpha]-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat Genet 2003, 34(3):267–273. 10.1038/ng1180
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, et al.: Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proceedings of the National Academy of Sciences of the United States of America 2005, 102(43):15545–15550. 10.1073/pnas.0506580102
Gene Ontology Consortium: The Gene Ontology (GO) database and informatics resource. Nucl Acids Res 2004, 32(suppl_1):D258–261. 10.1093/nar/gkh036
Ogata H, Goto S, Sato K, Fujibuchi W, Bono H, Kanehisa M: KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucl Acids Res 1999, 27(1):29–34. 10.1093/nar/27.1.29
Doniger S, Salomonis N, Dahlquist K, Vranizan K, Lawlor S, Conklin B: MAPPFinder: using Gene Ontology and GenMAPP to create a global gene-expression profile from microarray data. Genome Biology 2003, 4(1):R7. 10.1186/gb-2003-4-1-r7
Dennis G, Sherman B, Hosack D, Yang J, Gao W, Lane H, Lempicki R: DAVID: Database for Annotation, Visualization, and Integrated Discovery. Genome Biology 2003, 4(9):R60. 10.1186/gb-2003-4-9-r60
Masseroli M, Martucci D, Pinciroli F: GFINDer: Genome Function INtegrated Discoverer through dynamic annotation, statistical analysis, and mining. Nucl Acids Res 2004, 32(suppl-2):W293–300. 10.1093/nar/gkh432
Zhang B, Kirov S, Snoddy J: WebGestalt: an integrated system for exploring gene sets in various biological contexts. Nucl Acids Res 2005, 33(suppl_2):W741–748. 10.1093/nar/gki475
Hulsen T, de Vlieg J, Alkema W: BioVenn - a web application for the comparison and visualization of biological lists using area-proportional Venn diagrams. BMC Genomics 2008, 9(1):488. 10.1186/1471-2164-9-488
Fury W, Batliwalla F, Gregersen PK, Wentian L: Overlapping Probabilities of Top Ranking Gene Lists, Hypergeometric Distribution, and Stringency of Gene Selection Criterion. Engineering in Medicine and Biology Society, 2006 EMBS '06 28th Annual International Conference of the IEEE: 2006 2006, 5531–5534.
Ma'ayan A: Insights into the organization of biochemical regulatory networks using graph theory analyses. J Biol Chem 2008, 284(9):5451–5455. 10.1074/jbc.R800056200
Ma'ayan A, Blitzer RD, Iyengar R: TOWARD PREDICTIVE MODELS OF MAMMALIAN CELLS. Annual Review of Biophysics and Biomolecular Structure 2005, 34(1):319–349. 10.1146/annurev.biophys.34.040204.144415
Berger SI, Posner JM, Ma'ayan A: Genes2Networks: connecting lists of gene symbols using mammalian protein interactions databases. BMC Bioinformatics 2007, 8(1):372. 10.1186/1471-2105-8-372
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T: Cytoscape: A Software Environment for Integrated Models of Biomolecular Interaction Networks. Genome Research 2003, 13(11):2498–2504. 10.1101/gr.1239303
Stark C, Breitkreutz B-J, Reguly T, Boucher L, Breitkreutz A, Tyers M: BioGRID: a general repository for interaction datasets. Nucl Acids Res 2006, 34(suppl_1):D535–539. 10.1093/nar/gkj109
Bader GD, Betel D, Hogue CWV: BIND: the Biomolecular Interaction Network Database. Nucl Acids Res 2003, 31(1):248–250. 10.1093/nar/gkg056
Mishra GR, Suresh M, Kumaran K, Kannabiran N, Suresh S, Bala P, Shivakumar K, Anuradha N, Reddy R, Raghavan TM, et al.: Human protein reference database--2006 update. Nucl Acids Res 2006, 34(suppl_1):D411–414. 10.1093/nar/gkj141
Hermjakob H, Montecchi-Palazzi L, Bader G, Wojcik J, Salwinski L, Ceol A, Moore S, Orchard S, Sarkans U, von Mering C, et al.: The HUPO PSI's Molecular Interaction format[mdash]a community standard for the representation of protein interaction data. 2004, 22(2):177–183.
Xenarios I, Rice DW, Salwinski L, Baron MK, Marcotte EM, Eisenberg D: DIP: the Database of Interacting Proteins. Nucl Acids Res 2000, 28(1):289–291. 10.1093/nar/28.1.289
Zanzoni A, Montecchi-Palazzi L, Quondam M, Ausiello G, Helmer-Citterich M, Cesareni G: MINT: a Molecular INTeraction database. FEBS Letters Protein Domains 2002, 513(1):135–140. 10.1016/S0014-5793(01)03293-8
Beuming T, Skrabanek L, Niv MY, Mukherjee P, Weinstein H: PDZBase: a protein-protein interaction database for PDZ-domains. Bioinformatics 2005, 21(6):827–828. 10.1093/bioinformatics/bti098
Grant SG: Systems biology in neuroscience: bridging genes to cognition. Current Opinion in Neurobiology 2003, 13(5):577–582. 10.1016/j.conb.2003.09.016
Rual J-F, Venkatesan K, Hao T, Hirozane-Kishikawa T, Dricot A, Li N, Berriz GF, Gibbons FD, Dreze M, Ayivi-Guedehoussou N, et al.: Towards a proteome-scale map of the human protein-protein interaction network. 2005, 437(7062):1173–1178.
Stelzl U, Worm U, Lalowski M, Haenig C, Brembeck F, Goehler H, Stroedicke M, Zenkner M, Schoenherr A, Koeppen S: A Human Protein-Protein Interaction Network: A Resource for Annotating the Proteome. Cell 2005, 122(6):957–968. 10.1016/j.cell.2005.08.029
Ma'ayan A, Jenkins S, Neves S, Hasseldine A, Grace E, Dubin-Thaler B, Eungdamrong N, Weng G, Ram P, Rice J: Formation of Regulatory Patterns During Signal Propagation in a Mammalian Cellular Network. Science 2005, 309(5737):1078–1083. 10.1126/science.1108876
Obayashi T, Hayashi S, Shibaoka M, Saeki M, Ohta H, Kinoshita K: COXPRESdb: a database of coexpressed gene networks in mammals. Nucl Acids Res 2008, 36(suppl_1):D77–82.
Ogata H, Goto S, Fujibuchi W, Kanehisa M: Computation with the KEGG pathway database. Biosystems 1998, 47: 119–128. 10.1016/S0303-2647(98)00017-3
Griffiths-Jones S, Grocock RJ, van Dongen S, Bateman A, Enright AJ: miRBase: microRNA sequences, targets and gene nomenclature. Nucl Acids Res 2006, 34(suppl_1):D140–144. 10.1093/nar/gkj112
Lachmann A, Ma'ayan A: KEA: kinase enrichment analysis. Bioinformatics 2009, 25: 684–686. 10.1093/bioinformatics/btp026
Wishart DS, Knox C, Guo AC, Eisner R, Young N, Gautam B, Hau DD, Psychogios N, Dong E, Bouatra S, et al.: HMDB: a knowledgebase for the human metabolome. Nucl Acids Res 2009, 37(suppl_1):D603–610. 10.1093/nar/gkn810
Hamosh A, Scott AF, Amberger J, Bocchini C, Valle D, McKusick VA: Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucl Acids Res 2002, 30(1):52–55. 10.1093/nar/30.1.52
Cordeddu V, Di Schiavi E, Pennacchio LA, Ma'ayan A, Sarkozy A, Fodale V, Cecchetti S, Cardinale A, Martin J, Schackwitz W, et al.: Mutation of SHOC2 promotes aberrant protein N-myristoylation and causes Noonan-like syndrome with loose anagen hair. Nat Genet 2009, 41(9):1022–1026. 10.1038/ng.425
Lu R, Markowetz F, Unwin RD, Leek JT, Airoldi EM, MacArthur BD, Lachmann A, Rozov R, Ma/'ayan A, Boyer LA, et al.: Systems-level dynamic analyses of fate change in murine embryonic stem cells. Nature 2009, 462(7271):358–362. 10.1038/nature08575
Brill LM, Xiong W, Lee K-B, Ficarro SB, Crain A, Xu Y, Terskikh A, Snyder EY, Ding S: Phosphoproteomic Analysis of Human Embryonic Stem Cells. Cell Stem Cell 2009, 5(2):204–213. 10.1016/j.stem.2009.06.002
Van Hoof D, Muñoz J, Braam SR, Pinkse MWH, Linding R, Heck AJR, Mummery CL, Krijgsveld J: Phosphorylation Dynamics during Early Differentiation of Human Embryonic Stem Cells. Cell Stem Cell 2009, 5(2):214–226. 10.1016/j.stem.2009.05.021
Wang J, Rao S, Chu J, Shen X, Levasseur DN, Theunissen TW, Orkin SH: A protein interaction network for pluripotency of embryonic stem cells. Nature 2006, 444(7117):364–368. 10.1038/nature05284