Systematic Evaluation of Molecular Networks for Discovery of Disease Genes

Cell Systems - Tập 6 - Trang 484-495.e5 - 2018
Justin K. Huang1, Daniel E. Carlin2, Michael Ku Yu1,2, Wei Zhang2, Jason F. Kreisberg2, Pablo Tamayo2,3, Trey Ideker1,2,3
1Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA 92093, USA
2School of Medicine, University of California, San Diego, La Jolla, CA, 92093 USA
3Moores Cancer Center, University of California, San Diego, La Jolla, CA 92093, USA

Tài liệu tham khảo

Alfarano, 2005, The biomolecular interaction network database and related tools 2005 update, Nucleic Acids Res., 33, D418, 10.1093/nar/gki051 Aranda, 2010, The IntAct molecular interaction database in 2010, Nucleic Acids Res., 38, D525, 10.1093/nar/gkp878 Bader, 2001, BIND—the biomolecular interaction network database, Nucleic Acids Res., 29, 242, 10.1093/nar/29.1.242 Bader, 2003, BIND: the biomolecular interaction network database, Nucleic Acids Res., 31, 248, 10.1093/nar/gkg056 Bader, 2006, Pathguide: a pathway resource list, Nucleic Acids Res., 34, D504, 10.1093/nar/gkj126 Breitkreutz, 2003, The GRID: the general repository for interaction datasets, Genome Biol., 4, R23, 10.1186/gb-2003-4-3-r23 Breitkreutz, 2008, The BioGRID interaction database: 2008 update, Nucleic Acids Res., 36, D637, 10.1093/nar/gkm1001 Calderone, 2013, mentha: a resource for browsing integrated protein-interaction networks, Nat. Methods, 10, 690, 10.1038/nmeth.2561 Cao, 2014, New directions for diffusion-based network prediction of protein function: incorporating pathways with confidence, Bioinformatics, 30, i219, 10.1093/bioinformatics/btu263 CARDIoGRAMplusC4D Consortium, 2013, Large-scale association analysis identifies new risk loci for coronary artery disease, Nat. Genet., 45, 25, 10.1038/ng.2480 Carter, 2013, Genotype to phenotype via network analysis, Curr. Opin. Genet. Dev., 23, 611, 10.1016/j.gde.2013.10.003 Cerami, 2011, Pathway Commons, a web resource for biological pathway data, Nucleic Acids Res., 39, D685, 10.1093/nar/gkq1039 Chatr-Aryamontri, 2013, The BioGRID interaction database: 2013 update, Nucleic Acids Res., 41, D816, 10.1093/nar/gks1158 Chatr-Aryamontri, 2015, The BioGRID interaction database: 2015 update, Nucleic Acids Res., 43, D470, 10.1093/nar/gku1204 Croft, 2011, Reactome: a database of reactions, pathways and biological processes, Nucleic Acids Res., 39, D691, 10.1093/nar/gkq1018 Croft, 2014, The Reactome pathway knowledgebase, Nucleic Acids Res., 42, D472, 10.1093/nar/gkt1102 Das, 2012, HINT: high-quality protein interactomes and their applications in understanding human disease, BMC Syst. Biol., 6, 92, 10.1186/1752-0509-6-92 Fabregat, 2016, The reactome pathway knowledgebase, Nucleic Acids Res., 44, D481, 10.1093/nar/gkv1351 Franceschini, 2013, STRING v9. 1: protein-protein interaction networks, with increased coverage and integration, Nucleic Acids Res., 41, D808, 10.1093/nar/gks1094 Gilbert, 2005, Biomolecular interaction network database, Brief. Bioinform., 6, 194, 10.1093/bib/6.2.194 Greene, 2015, Understanding multicellular function and disease with human tissue-specific networks, Nat. Genet., 47, 569, 10.1038/ng.3259 Hermjakob, 2004, IntAct: an open source molecular interaction database, Nucleic Acids Res., 32, D452, 10.1093/nar/gkh052 Herwig, 2016, Analyzing and interpreting genome data at the network level with ConsensusPathDB, Nat. Protoc., 11, 1889, 10.1038/nprot.2016.117 Hill, 2016, Inferring causal molecular networks: empirical assessment through a community-based effort, Nat. Methods, 13, 310, 10.1038/nmeth.3773 Hofree, 2013, Network-based stratification of tumor mutations, Nat. Methods, 10, 1108, 10.1038/nmeth.2651 Huttlin, 2015, The BioPlex network: a systematic exploration of the human interactome, Cell, 162, 425, 10.1016/j.cell.2015.06.043 Huttlin, 2017, Architecture of the human interactome defines protein communities and disease networks, Nature, 545, 505, 10.1038/nature22366 Jensen, 2009, STRING 8—a global view on proteins and their functional interactions in 630 organisms, Nucleic Acids Res., 37, D412, 10.1093/nar/gkn760 Joshi-Tope, 2005, Reactome: a knowledgebase of biological pathways, Nucleic Acids Res., 33, D428, 10.1093/nar/gki072 Kamburov, 2009, ConsensusPathDB—a database for integrating human functional interaction networks, Nucleic Acids., 37, D623, 10.1093/nar/gkn698 Kamburov, 2011, ConsensusPathDB: toward a more complete picture of cell biology, Nucleic Acids Res., 39, D712, 10.1093/nar/gkq1156 Kamburov, 2013, The ConsensusPathDB interaction database: 2013 update, Nucleic Acids Res., 41, D793, 10.1093/nar/gks1055 Kerrien, 2007, IntAct—open source resource for molecular interaction data, Nucleic Acids Res., 35, D561, 10.1093/nar/gkl958 Kerrien, 2012, The IntAct molecular interaction database in 2012, Nucleic Acids Res., 40, D841, 10.1093/nar/gkr1088 Khurana, 2013, Interpretation of genomic variants using a unified biological network approach, PLoS Comput. Biol., 9, e1002886, 10.1371/journal.pcbi.1002886 Kim, 2014, Identifying disease candidate genes via large-scale gene network analysis, Int. J. Data Min. Bioinform., 10, 175, 10.1504/IJDMB.2014.064014 Kim, 2017, Decomposing oncogenic transcriptional signatures to generate maps of divergent cellular states, Cell Syst., 5, 105, 10.1016/j.cels.2017.08.002 Köhler, 2008, Walking the interactome for prioritization of candidate disease genes, Am. J. Hum. Genet., 82, 949, 10.1016/j.ajhg.2008.02.013 Lee, 2011, Prioritizing candidate disease genes by network-based boosting of genome-wide association data, Genome Res., 21, 1109, 10.1101/gr.118992.110 Leiserson, 2015, Pan-cancer network analysis identifies combinations of rare somatic mutations across pathways and protein complexes, Nat. Genet., 47, 106, 10.1038/ng.3168 Li, 2017, A scored human protein-protein interaction network to catalyze genomic interpretation, Nat. Methods, 14, 61, 10.1038/nmeth.4083 Liberzon, 2011, Molecular signatures database (MSigDB) 3.0, Bioinformatics, 27, 1739, 10.1093/bioinformatics/btr260 MacArthur, 2017, The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog), Nucleic Acids Res., 45, D896, 10.1093/nar/gkw1133 Matthews, 2009, Reactome knowledgebase of human biological pathways and processes, Nucleic Acids Res., 37, D619, 10.1093/nar/gkn863 Mishra, 2006, Human protein reference database—2006 update, Nucleic Acids Res., 34, D411, 10.1093/nar/gkj141 Mostafavi, 2008, GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function, Genome Biol., 9, S4, 10.1186/gb-2008-9-s1-s4 Novarino, 2014, Exome sequencing links corticospinal motor neuron disease to common neurodegenerative disorders, Science, 343, 506, 10.1126/science.1247363 Orchard, 2014, The MIntAct project—IntAct as a common curation platform for 11 molecular interaction databases, Nucleic Acids Res., 42, D358, 10.1093/nar/gkt1115 Paull, 2013, Discovering causal pathways linking genomic events to transcriptional states using Tied Diffusion Through Interacting Events (TieDIE), Bioinformatics, 29, 2757, 10.1093/bioinformatics/btt471 Peri, 2003, Development of human protein reference database as an initial platform for approaching systems biology in humans, Genome Res., 13, 2363, 10.1101/gr.1680803 Peri, 2004, Human protein reference database as a discovery resource for proteomics, Nucleic Acids Res., 32, D497, 10.1093/nar/gkh070 Pillich, 2017, NDEx: a community resource for sharing and publishing of biological networks, Methods Mol. Biol., 1558, 271, 10.1007/978-1-4939-6783-4_13 Piñero, 2015, DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes, Database, 2015, bav028, 10.1093/database/bav028 Piñero, 2016, DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants, Nucleic Acids Res., 45, D833, 10.1093/nar/gkw943 Prasad, 2009, Human protein reference database—2009 update, Nucleic Acids Res., 37, D767, 10.1093/nar/gkn892 Pratt, 2015, NDEx, the network data Exchange, Cell Syst., 1, 302, 10.1016/j.cels.2015.10.001 Qian, 2014, Identifying disease associated genes by network propagation, BMC Syst. Biol., 8, S6, 10.1186/1752-0509-8-S1-S6 Razick, 2008, iRefIndex: a consolidated protein interaction database with provenance, BMC Bioinformatics, 9, 405, 10.1186/1471-2105-9-405 Rolland, 2014, A proteome-scale map of the human interactome network, Cell, 159, 1212, 10.1016/j.cell.2014.10.050 Rousseeuw, 1993, Alternatives to the median absolute deviation, J. Am. Stat. Assoc., 88, 1273, 10.1080/01621459.1993.10476408 Salwinski, 2004, The database of interacting proteins: 2004 update, Nucleic Acids Res., 32, D449, 10.1093/nar/gkh086 Schaefer, 2009, PID: the pathway interaction database, Nucleic Acids Res., 37, D674, 10.1093/nar/gkn653 Seabold, S., and Perktold, J. (2010). Statsmodels: econometric and statistical modeling with python. In Proceedings of the 9th Python in Science Conference, p. 61. Snel, 2000, STRING: a web-server to retrieve and display the repeatedly occurring neighbourhood of a gene, Nucleic Acids Res., 28, 3442, 10.1093/nar/28.18.3442 Sokolov, 2016, Pathway-based genomics prediction using generalized elastic net, PLoS Comput. Biol., 12, e1004790, 10.1371/journal.pcbi.1004790 Stark, 2006, BioGRID: a general repository for interaction datasets, Nucleic Acids Res., 34, D535, 10.1093/nar/gkj109 Stark, 2011, The BioGRID interaction database: 2011 update, Nucleic Acids Res., 39, D698, 10.1093/nar/gkq1116 Szklarczyk, 2011, The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored, Nucleic Acids Res., 39, D561, 10.1093/nar/gkq973 Szklarczyk, 2015, STRING v10: protein–protein interaction networks, integrated over the tree of life, Nucleic Acids Res., 43, D447, 10.1093/nar/gku1003 Szklarczyk, 2017, The STRING database in 2017: quality-controlled protein-protein association networks, made broadly accessible, Nucleic Acids Res., 45, D362, 10.1093/nar/gkw937 Tabor, 2002, Candidate-gene approaches for studying complex genetic traits: practical considerations, Nat. Rev. Genet., 3, 391, 10.1038/nrg796 Turner, 2010, iRefWeb: interactive analysis of consolidated protein interaction data and their supporting evidence, Database, 2010, baq023, 10.1093/database/baq023 Vandin, 2011, Algorithms for detecting significantly mutated pathways in cancer, J. Comput. Biol., 18, 507, 10.1089/cmb.2010.0265 Vanunu, 2010, Associating genes and protein complexes with disease via network propagation, PLoS Comput. Biol., 6, e1000641, 10.1371/journal.pcbi.1000641 Vaske, 2010, Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM, Bioinformatics, 26, i237, 10.1093/bioinformatics/btq182 Vastrik, 2007, Reactome: a knowledge base of biologic pathways and processes, Genome Biol., 8, R39, 10.1186/gb-2007-8-3-r39 Von Mering, 2003, STRING: a database of predicted functional associations between proteins, Nucleic Acids Res., 31, 258, 10.1093/nar/gkg034 Von Mering, 2005, STRING: known and predicted protein–protein associations, integrated and transferred across organisms, Nucleic Acids Res., 33, D433, 10.1093/nar/gki005 Von Mering, 2007, STRING 7—recent developments in the integration and prediction of protein interactions, Nucleic Acids Res., 35, D358, 10.1093/nar/gkl825 Warde-Farley, 2010, The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function, Nucleic Acids Res., 38, W214, 10.1093/nar/gkq537 Willsey, 2013, Coexpression networks implicate human midfetal deep cortical projection neurons in the pathogenesis of autism, Cell, 155, 997, 10.1016/j.cell.2013.10.020 Wu, 2013, BioGPS and MyGene.info: organizing online, gene-centric information, Nucleic Acids Res., 41, D561, 10.1093/nar/gks1114 Wu, 2010, A human functional protein interaction network and its application to cancer data analysis, Genome Biol., 11, R53, 10.1186/gb-2010-11-5-r53 Xenarios, 2000, DIP: the database of interacting proteins, Nucleic Acids Res., 28, 289, 10.1093/nar/28.1.289 Xenarios, 2001, DIP: the database of interacting proteins: 2001 update, Nucleic Acids Res., 29, 239, 10.1093/nar/29.1.239 Xenarios, 2002, DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions, Nucleic Acids Res., 30, 303, 10.1093/nar/30.1.303 Xin, 2016, High-performance web services for querying gene and variant annotation, Genome Biol., 17, 91, 10.1186/s13059-016-0953-9 Yeger-Lotem, 2015, Human protein interaction networks across tissues and diseases, Front. Genet., 6, 257, 10.3389/fgene.2015.00257 Yu, 2013, Review of biological network data and its applications, Genomics Inform., 11, 200, 10.5808/GI.2013.11.4.200 Zuberi, 2013, GeneMANIA prediction server 2013 update, Nucleic Acids Res., 41, W115, 10.1093/nar/gkt533