Gene co-expression network construction and analysis for identification of genetic biomarkers associated with glioblastoma multiforme using topological findings

Seema Sandeep Redekar1,2, Satishkumar L. Varma1, Atanu Bhattacharjee3
1Pillai College of Engineering, Mumbai, India
2SIES Graduate School of Technology, Mumbai, India
3Leicester Real World Evidence Unit, University of Leicester, Leicester, UK

Tóm tắt

Glioblastoma multiforme (GBM) is one of the most malignant types of central nervous system tumors. GBM patients usually have a poor prognosis. Identification of genes associated with the progression of the disease is essential to explain the mechanisms or improve the prognosis of GBM by catering to targeted therapy. It is crucial to develop a methodology for constructing a biological network and analyze it to identify potential biomarkers associated with disease progression. Gene expression datasets are obtained from TCGA data repository to carry out this study. A survival analysis is performed to identify survival associated genes of GBM patient. A gene co-expression network is constructed based on Pearson correlation between the gene’s expressions. Various topological measures along with set operations from graph theory are applied to identify most influential genes linked with the progression of the GBM. Ten key genes are identified as a potential biomarkers associated with GBM based on centrality measures applied to the disease network. These genes are SEMA3B, APS, SLC44A2, MARK2, PITPNM2, SFRP1, PRLH, DIP2C, CTSZ, and KRTAP4.2. Higher expression values of two genes, SLC44A2 and KRTAP4.2 are found to be associated with progression and lower expression values of seven gens SEMA3B, APS, MARK2, PITPNM2, SFRP1, PRLH, DIP2C, and CTSZ are linked with the progression of the GBM. The proposed methodology employing a network topological approach to identify genetic biomarkers associated with cancer.

Tài liệu tham khảo

Ostrom QT, Patil N, Cioffi G, Waite K, Kruchko C, Barnholtz-Sloan JS. CBTRUS statistical report: primary brain and other central nervous system tumors diagnosed in the United States in 2013–2017. Neuro-oncology. 2020;22(1):iv1–96. Tan AC, Ashley DM, Lopez GY, Malinzak M, Friedman HS, Khasraw M. Management of glioblastoma: State of the art and future directions. CA. 2020;70(4):299–312. Dumitrescu RG. Epigenetic markers of early tumor development. Cancer Epigenetics. 2012;863:3–14. Hayat, M. A., ed. Tumors of the Central Nervous System, Volume 1: Gliomas: Glioblastoma (Part 1). Vol. 1. Springer Science & Business Media, 2011. Jain KK. A critical overview of targeted therapies for glioblastoma. Front Oncol. 2018;8:419. Yan W, Xue W, Chen J, Hu G. Biological networks for cancer candidate biomarkers discovery. Cancer Inform. 2016;15:CIN-S39458. Kim YW, Koul D, Kim SH, Lucio-Eterovic AK, Freire PR, Yao J, Wang J, Almeida JS, Aldape K, Yung WA. Identification of prognostic gene signatures of glioblastoma: a study based on TCGA data analysis. Neuro-oncology. 2013;15(7):829–39. Redekar SS, Varma SL. A survey on community detection methods and its application in biological network. In 2022 International Conference on Applied Artificial Intelligence and Computing (ICAAIC), pp. 1030–1037. IEEE, 2022. Sarmah T, Bhattacharyya DK. A study of tools for differential co-expression analysis for RNA-Seq data. Inform Med Unlocked. 2021;26:100740. Saberi M, Khosrowabadi R, Khatibi A, Misic B, Jafari G. Topological impact of negative links on the stability of resting-state brain network. Sci Rep. 2021;11(1):1–14. Negre CFA, Morzan UN, Hendrickson HP, Pal R, Lisi GP, Loria JP, Rivalta I, Ho J, Batista VS. Eigenvector centrality for characterization of protein allosteric pathways. Proceed Natl Acad Sci. 2018;115(52):E12201–8. Kardos O, London A, Vinko T. Stability of network centrality measures: a numerical study. Soc Netw Anal Min. 2020;10(1):1–17. Tulu MM, Hou R, Younas T. Identifying influential nodes based on community structure to speed up the dissemination of information in complex network. IEEE Access. 2018;6:7390–401. Laeuchli J, Ramirez-Cruz Y, Trujillo-Rasua R. Analysis of centrality measures under differential privacy models. Appl Math Comput. 2022;412:126546. Landherr A, Friedl B, Heidemann J. A critical review of centrality measures in social networks. Bus Inf Syst Eng. 2010;2(6):371–85. Das T, Andrieux G, Ahmed M, Chakraborty S. Integration of online omics-data resources for cancer research. Front Genet. 2020;11:578345. Tadist K, Najah S, Nikolov NS, Mrabti F, Zahi A. Feature selection methods and genomic big data: a systematic review. J Big Data. 2019;6(1):1–24. Navarro FCP, Mohsen H, Yan C, Li S, Gu M, Meyerson W, Gerstein M. Genomics and data science: an application within an umbrella. Gen Biol. 2019;20(1):1–11. Huang H-H, Liang Y. Hybrid L1/2+ 2 method for gene selection in the Cox proportional hazards model. Comput Methods Programs Biomed. 2018;164:65–73. Emura T, Matsui S, Chen HY. compound. Cox: univariate feature selection and compound covariate for predicting survival. Comput Methods Programs Biomed. 2019;168:21–37. Redekar SS, Varma SL, Bhattacharjee A. Identification of key genes associated with survival of glioblastoma multiforme using integrated analysis of TCGA datasets. Comput Methods Programs Biomed Update. 2022;2:100051. Tieri P, Farina L, Petti M, Astolfi L, Paci P, Castiglione F. Network inference and reconstruction in bioinformatics. 2019;2:805–13. Liu W, Li Li, Long X, You W, Zhong Y, Wang M, Tao H, Lin S, He H. Construction and analysis of gene co-expression networks in Escherichia coli. Cells. 2018;7(3):19. Guo D, Zhang S, Tang Z, Wang H. Construction of gene-classifier and co-expression network analysis of genes in association with major depressive disorder. Psychiatry Res. 2020;293:113387. Swarup R, Bhattacharyya DK, Kalita JK. Reconstruction of gene co-expression network from microarray data using local expression patterns. BMC Bioinformatics. 2014;15(7):S10. Jiang J, Sun X, Wu W, Li L, Wu H, Zhang L, Yu G, Li Y. Construction and application of a co-expression network in Mycobacterium tuberculosis. Scientific Rep. 2016;6:28422. Panditrao G, Bhowmick R, Meena C, Sarkar RR. Emerging landscape of molecular interaction networks: opportunities, challenges and prospects. J Biosci. 2022;47(2):1–26. Wang P. Statistical identification of important nodes in biological systems. J Syst Sci Complexity. 2021;34(4):1454–70. Puth M-T, Neuhauser M, Ruxton GD. Effective use of Spearman’s and Kendall’s correlation coefficients for association between two measured traits. Anim Behav. 2015;102:77–84. Wang J. On the relationship between Pearson correlation coefficient and Kendall’s tau under bivariate homogeneous shock model. International Scholarly Research Notices 2012:2012. He W, Peng D, Tkachenko M, Xiao Z. Gaps in lattices of (para) topological group topologies and cardinal functions. Topolo Applications. 2019;264:89–104. Farhadian M, Rafat SA, Panahi B, Mayack C. Weighted gene co-expression network analysis identifies modules and functionally enriched pathways in the lactation process. Scientific Rep. 2021;11(1):1–15. Singh A, Singh RR, Iyengar SRS. Node-weighted centrality: a new way of centrality hybridization. Comput Soc Net. 2020;7(1):1–33. Voigt A, Almaas E. Assessment of weighted topological overlap (wTO) to improve fidelity of gene co-expression networks. BMC Bioinformatics. 2019;20(1):1–11. Sahoo R, Rani TS, Bhavani SD. Differentiating cancer from normal protein-protein interactions through network analysis. Emerging Trends in Computer Science and Applied Computing. Chapter 17. 2016. p. 253–269. Ghiasi MM, Zendehboudi S, Mohsenipour AA. Decision tree-based diagnosis of coronary artery disease: CART model. Comput Methods Programs Biomed. 2020;192:105400.