Maftools: efficient and comprehensive analysis of somatic variants in cancer
Tóm tắt
Numerous large-scale genomic studies of matched tumor-normal samples have established the somatic landscapes of most cancer types. However, the downstream analysis of data from somatic mutations entails a number of computational and statistical approaches, requiring usage of independent software and numerous tools. Here, we describe an R Bioconductor package, Maftools, which offers a multitude of analysis and visualization modules that are commonly used in cancer genomic studies, including driver gene identification, pathway, signature, enrichment, and association analyses. Maftools only requires somatic variants in Mutation Annotation Format (MAF) and is independent of larger alignment files. With the implementation of well-established statistical and computational methods, Maftools facilitates data-driven research and comparative analysis to discover novel results from publicly available data sets. In the present study, using three of the well-annotated cohorts from The Cancer Genome Atlas (TCGA), we describe the application of Maftools to reproduce known results. More importantly, we show that Maftools can also be used to uncover novel findings through integrative analysis.
Từ khóa
Tài liệu tham khảo
2016, Kataegis expression signature in breast cancer is associated with late onset, better prognosis, and higher HER2 levels, Cell Rep, 16, 672, 10.1016/j.celrep.2016.06.026
2018, Scalable open science approach for mutation calling of tumor exomes using multiple genomic pipelines, Cell Syst, 6, 271, 10.1016/j.cels.2018.03.002
2014, ErbB targeting inhibitors repress cell migration of esophageal squamous cell carcinoma and adenocarcinoma cells by distinct signaling pathways, J Mol Med (Berl), 92, 1209, 10.1007/s00109-014-1187-5
2016, Spatial intratumoral heterogeneity and temporal clonal evolution in esophageal squamous cell carcinoma, Nat Genet, 48, 1500, 10.1038/ng.3683
2015, Voltage-gated Na+ channel activity increases colon cancer transcriptional activity and invasion via persistent MAPK signaling, Sci Rep, 5, 11541, 10.1038/srep11541
2018, Genomic and epigenomic aberrations in esophageal squamous cell carcinoma and implications for patients, Gastroenterology, 154, 374, 10.1053/j.gastro.2017.06.066
R Core Team. 2018. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/ .
2016, Mutational signatures in esophageal adenocarcinoma define etiologically distinct subgroups with therapeutic relevance, Nat Genet, 48, 1131, 10.1038/ng.3659