Integrative analysis of gene expression and methylation data for breast cancer cell lines

BioData Mining - Tập 11 - Trang 1-16 - 2018
Catherine Li1, Juyon Lee2, Jessica Ding3, Shuying Sun4
1Westwood High School, Austin, USA
2Korea International School Pangyo Campus, Seongnam, South Korea
3Liberal Arts and Science Academy, Austin, USA
4Department of Mathematics, Texas State University, San Marcos, USA

Tóm tắt

The deadly costs of cancer and necessity for an accurate method of early cancer detection have demanded the identification of genetic and epigenetic factors associated with cancer. DNA methylation, an epigenetic event, plays an important role in cancer susceptibility. In this paper, we use DNA methylation and gene expression data integration and pathway analysis to further explore and understand the complex relationship between methylation and gene expression. Through linear modeling and analysis of variance, we obtain genes that show a significant correlation between methylation and gene expression. We then examine the functions and relationships of these genes using bioinformatic tools and databases. In particular, using ConsensusPathDB, we analyze the networks of statistically significant genes to identify hub genes, genes with a large number of links to other genes. We identify eight major hub genes, all in strong association with cancer susceptibility. Through further analysis of the function, gene expression level, and methylation level of these hub genes, we conclude that they are novel potential biomarkers for breast cancer. Our findings have various implications for cancer screening, early detection methods, and potential novel treatments for cancer. Researchers can also use our results to develop more effective methods for cancer study.

Tài liệu tham khảo

Lim DH, Maher E. DNA methylation: a form of epigenetic control of gene expression. The Obstetrician & Gynaecologist. 2010;12:6. Neve RM, Chin K, Fridlyand J, Yeh J, Baehner FL, Fevr T, Clark L, Bayani N, Coppe JP, Tong F, et al. A collection of breast cancer cell lines for the study of functionally distinct cancer subtypes. Cancer Cell. 2006;10(6):515–27. Cho YH, McCullough LE, Gammon MD, Wu HC, Zhang YJ, Wang Q, Xu X, Teitelbaum SL, Neugut AI, Chen J, et al. Promoter Hypermethylation in white blood cell DNA and breast Cancer risk. J Cancer. 2015;6(9):819–24. Xu L, Mitra-Behura S, Alston B, Zong Z, Sun S. Identifying DNA methylation variation patterns to obtain potential breast Cancer biomarker genes. Int J Biomed Data Mining. 2015;4(115) Yang X, Yan L, Davidson NE. DNA methylation in breast cancer. Endocr Relat Cancer. 2001;8(2):115–27. Gomez-Cabrero D, Abugessaisa I, Maier D, Teschendorff A, Merkenschlager M, Gisel A, Ballestar E, Bongcam-Rudloff E, Conesa A, Tegner J. Data integration in the era of omics: current and future challenges. BMC Syst Biol. 2014;8(Suppl 2):I1. Huang S, Chaudhary K, Garmire LX. More is better: recent progress in multi-Omics data integration methods. Front Genet. 2017;8:84. Sun Z, Asmann YW, Kalari KR, Bot B, Eckel-Passow JE, Baker TR, Carr JM, Khrebtukova I, Luo S, Zhang L, et al. Integrated analysis of gene expression, CpG island methylation, and gene copy number in breast cancer cells by deep sequencing. PLoS One. 2011;6(2):e17490. Li W, Zhang S, Liu CC, Zhou XJ. Identifying multi-layer gene regulatory modules from multi-dimensional genomic data. Bioinformatics. 2012;28(19):2458–66. Louhimo R, Hautaniemi S. CNAmet: an R package for integrating copy number, methylation and expression data. Bioinformatics. 2011;27(6):887–8. Sun S, Huang YW, Yan PS, Huang TH, Lin S. Preprocessing differential methylation hybridization microarray data. BioData Min. 2011;4:13. Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U, Speed TP. Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics. 2003;4(2):249–64. Huang TH, Perry MR, Laux DE. Methylation profiling of CpG islands in human breast cancer cells. Hum Mol Genet. 1999;8(3):459–70. Yan PS, Chen CM, Shi H, Rahmatpanah F, Wei SH, Huang TH. Applications of CpG island microarrays for high-throughput analysis of DNA methylation. J Nutr. 2002;132(8 Suppl):2430S–4S. Yan PS, Potter D, Deatherage DE, Huang TH, Lin S. Differential methylation hybridization: profiling DNA methylation with a high-density CpG island microarray. Methods Mol Biol. 2009;507:89–106. Smyth GK, Speed T. Normalization of cDNA microarray data. Methods. 2003;31(4):265–73. GeneCards - Gene Database (www.genecards.org). Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005;102(43):15545–50. Kamburov A, Pentchev K, Galicka H, Wierling C, Lehrach H, Herwig R. ConsensusPathDB: toward a more complete picture of cell biology. Nucleic Acids Res. 2011;39(Database issue):D712–7. Kamburov A, Stelzl U, Lehrach H, Herwig R. The ConsensusPathDB interaction database: 2013 update. Nucleic Acids Res. 2013;41(Database issue):D793–800. Kamburov A, Wierling C, Lehrach H, Herwig R. ConsensusPathDB--a database for integrating human functional interaction networks. Nucleic Acids Res. 2009;37(Database issue):D623–8. Chari R, Coe BP, Vucic EA, Lockwood WW, Lam WL. An integrative multi-dimensional genetic and epigenetic strategy to identify aberrant genes and pathways in cancer. BMC Syst Biol. 2010;4:67. Wang X, Zang M, Xiao G. Epigenetic change detection and pattern recognition via Bayesian hierarchical hidden Markov models. Stat Med. 2013;32(13):16. Xiao G, Wang X, Khodursky AB. Modeling three-dimensional chromosome structures using gene expression data. J Am Stat Assoc. 2011;106(493):12. Hinshelwood RA, Clark SJ. Breast cancer epigenetics: normal human mammary epithelial cells as a model system. J Mol Med. 2008;86(12):14.