More Is Better: Recent Progress in Multi-Omics Data Integration Methods

Sijia Huang1,2, Kumardeep Chaudhary1
1Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI, United States
2Molecular Biosciences and Bioengineering Graduate Program, University of Hawaii at Manoa, Honolulu, HI, United States

Tóm tắt

Từ khóa


Tài liệu tham khảo

Akavia, 2010, An integrated approach to uncover drivers of cancer, Cell, 143, 1005, 10.1016/j.cell.2010.11.013

Aure, 2013, Identifying in-trans process associated genes in breast cancer by integrated analysis of copy number and expression data, PLoS ONE, 8, e53014, 10.1371/journal.pone.0053014

Bonnet, 2015, Integrative multi-omics module network inference with Lemon-Tree, PLoS Comput. Biol., 11, e1003983, 10.1371/journal.pcbi.1003983

Chari, 2010, An integrative multi-dimensional genetic and epigenetic strategy to identify aberrant genes and pathways in cancer, BMC Syst. Biol., 4, 67, 10.1186/1752-0509-4-67

Chen, 2016, Integrative analysis for identifying joint modular patterns of gene-expression and drug-response data, Bioinformatics, 32, 1724, 10.1093/bioinformatics/btw059

Chen, 2013, Structure-constrained sparse canonical correlation analysis with an application to microbiome data analysis, Biostatistics, 14, 244, 10.1093/biostatistics/kxs038

Chin, 2011, Cancer genomics: from discovery science to personalized medicine, Nat. Med., 17, 297, 10.1038/nm.2323

Cho, 2013, Dissecting cancer heterogeneity with a probabilistic genotype-phenotype model, Nucleic Acids Res., 41, 8011, 10.1093/nar/gkt577

Cline, 2007, Integration of biological networks and gene expression data using Cytoscape, Nat. Protoc., 2, 2366, 10.1038/nprot.2007.324

Griffiths, 2005, Infinite latent feature models and the Indian buffet process, Advances in Neural Information Processing Systems 18 (NIPS 2005), 475

Hagerty, 2005, Communicating prognosis in cancer care: a systematic review of the literature, Ann. Oncol., 16, 1005, 10.1093/annonc/mdi211

Hofmann, 2008, Kernel methods in machine learning, Ann. Stat, 36, 1171, 10.1214/009053607000000677

Huang, 2016, Novel personalized pathway-based metabolomics models reveal key metabolic pathways for breast cancer diagnosis, Genome Med., 8, 34, 10.1186/s13073-016-0289-9

Huang, 2014, A novel model to combine clinical and pathway-based transcriptomic information for the prognosis prediction of breast cancer, PLoS Comput. Biol., 10, e1003851, 10.1371/journal.pcbi.1003851

Hudson, 2010, International network of cancer genome projects, Nature, 464, 993, 10.1038/nature08987

Ideker, 2002, Discovering regulatory and signalling circuits in molecular interaction networks, Bioinformatics, 18, S233, 10.1093/bioinformatics/18.suppl_1.S233

Imoto, 2004, Combining microarrays and biological knowledge for estimating gene networks via Bayesian networks, J. Bioinform. Comput. Biol., 2, 77, 10.1142/S021972000400048X

Jennings, 2013, Bayesian methods for expression-based integration of various types of genomics data, EURASIP J. Bioinforma. Syst. Biol., 2013, 13, 10.1186/1687-4153-2013-13

Kim, 2014, Data integration for cancer clinical outcome prediction, J. Heal. Med. Informatics, 5, e122, 10.4172/2157-7420.1000e122

Kim, 2013, ATHENA: Identifying interactions between different levels of genomic data associated with cancer clinical outcomes using grammatical evolution neural network, BioData Min., 6, 23, 10.1186/1756-0381-6-23

Kim, 2014, Knowledge-driven genomic interactions: an application in ovarian cancer, BioData Min., 7, 20, 10.1186/1756-0381-7-20

Kim, 2016, Using knowledge-driven genomic interactions for multi-omics data analysis: metadimensional models for predicting clinical outcomes in ovarian carcinoma, J. Am. Med. Inform. Assoc., 24, 577, 10.1093/jamia/ocw165

Kim, 2012, Synergistic effect of different levels of genomic data for cancer clinical outcome prediction, J. Biomed. Inform., 45, 1191, 10.1016/j.jbi.2012.07.008

Kirk, 2012, Bayesian correlated clustering to integrate multiple datasets, Bioinformatics, 28, 3290, 10.1093/bioinformatics/bts595

Lê Cao, 2009, Sparse canonical methods for biological data integration: application to a cross-platform study, BMC Bioinform., 10, 34, 10.1186/1471-2105-10-34

Lanckriet, 2004, A statistical framework for genomic data fusion, Bioinformatics, 20, 2626, 10.1093/bioinformatics/bth,294

Lee, 2001, Algorithms for non-negative matrix factorization, Advances in Neural Information Processing Systems, 556

Li, 2012, Identifying multi-layer gene regulatory modules from multi-dimensional genomic data, Bioinformatics, 28, 2458, 10.1093/bioinformatics/bts476

Lin, 2013, Group sparse canonical correlation analysis for genomic data integration, BMC Bioinform., 14, 245, 10.1186/1471-2105-14-245

Lock, 2013, Bayesian consensus clustering, Bioinformatics, 29, 2610, 10.1093/bioinformatics/btt425

Lock, 2013, Joint and individual variation explained (JIVE) for integrated analysis of multiple data types, Ann. Appl. Stat., 7, 523, 10.1214/12-AOAS597

Louhimo, 2011, CNAmet: an R package for integrating copy number, methylation and expression data, Bioinformatics, 27, 887, 10.1093/bioinformatics/btr019

Mankoo, 2011, Time to recurrence and survival in serous ovarian tumors predicted from integrated genomic profiles, PLoS ONE, 6, e24709, 10.1371/journal.pone.0024709

Mo, 2013, Pattern discovery and cancer gene identification in integrated cancer genomic data, Proc. Natl. Acad. Sci. U.S.A, 110, 4245, 10.1073/pnas.1208949110

Ovaska, 2010, Large-scale data integration framework provides a comprehensive view on glioblastoma multiforme, Genome Med., 2, 65, 10.1186/gm186

Parkhomenko, 2009, Sparse canonical correlation analysis with application to genomic data integration, Stat. Appl. Genet. Mol. Biol., 8, 1, 10.2202/1544-6115.1406

Rabin, 2013, Predicting cancer prognosis using interactive online tools: a systematic review and implications for cancer care providers, Cancer Epidemiol. Biomarkers Prev., 22, 1645, 10.1158/1055-9965.EPI-13-0513

Ray, 2014, Bayesian joint analysis of heterogeneous genomics data, Bioinformatics, 30, 1370, 10.1093/bioinformatics/btu064

Ritchie, 2015, Methods of integrating data to uncover genotype–phenotype interactions, Nat. Rev. Genet., 16, 85, 10.1038/nrg3868

Ruffalo, 2015, Network-based integration of disparate omic data to identify “Silent Players” in cancer, PLOS Comput. Biol., 11, e1004595, 10.1371/journal.pcbi.1004595

Schaefer, 2012, HIPPIE: integrating protein interaction networks with experiment based quality scores, PLoS ONE, 7, e31826, 10.1371/journal.pone.0031826

Seoane, 2014, A pathway-based data integration framework for prediction of disease progression, Bioinformatics, 30, 838, 10.1093/bioinformatics/btt610

Shen, 2012, Integrative subtype discovery in glioblastoma using iCluster, PLoS ONE, 7, e35236, 10.1371/journal.pone.0035236

Shen, 2009, Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis, Bioinformatics, 25, 2906, 10.1093/bioinformatics/btp543

Speicher, 2015, Integrating different data types by regularized unsupervised multiple kernel learning with application to cancer subtype discovery, Bioinformatics, 31, i268, 10.1093/bioinformatics/btv244

Thibaux, 2007, Hierarchical beta processes and the indian buffet process, Artificial Intelligence and Statistics (AISTATS), 564

Tibshirani, 1994, Regression shrinkage and selection via the lasso, J. R. Stat. Soc. Ser. B, 58, 267

Tipping, 2001, Sparse Bayesian learning and the relevance vector machine, J. Mach. Learn. Res., 1, 211, 10.1162/15324430152748236

Van 't Veer, 2002, Gene expression profiling predicts clinical outcome of breast cancer, Nature, 415, 530, 10.1038/415530a

Vaske, 2010, Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM, Bioinformatics, 26, i237, 10.1093/bioinformatics/btq182

Wang, 2014, Similarity network fusion for aggregating data types on a genomic scale, Nat. Methods, 11, 333, 10.1038/nmeth.2810

Wei, 2016, Meta-dimensional data integration identifies critical pathways for susceptibility, tumorigenesis and progression of endometrial cancer, Oncotarget, 7, 55249, 10.18632/oncotarget.10509

Weinstein, 2013, The cancer genome atlas pan-cancer analysis project, Nat. Genet., 45, 1113, 10.1038/ng.2764

Witten, 2009, Extensions of sparse canonical correlation analysis with applications to genomic data, Stat. Appl. Genet. Mol. Biol., 8, 1, 10.2202/1544-6115.1470

You, 2010, A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network, BMC Bioinform., 11, 343, 10.1186/1471-2105-11-343

Yuan, 2011, Patient-specific data fusion defines prognostic cancer subtypes, PLoS Comput. Biol., 7, e1002227, 10.1371/journal.pcbi.1002227

Zhang, 2011, A novel computational framework for simultaneous integration of multiple types of genomic data to identify microRNA-gene regulatory modules, Bioinformatics, 27, i401, 10.1093/bioinformatics/btr206

Zhang, 2012, Discovery of multi-dimensional modules by integrative analysis of cancer genomic data, Nucleic Acids Res., 40, 9379, 10.1093/nar/gks725

Zhao, 2012, A Bayesian approach to discovering truth from conflicting sources for data integration, Proc. VLDB Endow., 5, 550, 10.14778/2168651.2168656