Classification of Widely and Rarely Expressed Genes with Recurrent Neural Network

Lei Chen1,2,3, XiaoYong Pan4, Yu-Hang Zhang5, Min Liu2, Tao Huang5, Yu-Dong Cai1
1School of Life Sciences, Shanghai University, Shanghai 200444, People's Republic of China
2College of Information Engineering, Shanghai Maritime University, Shanghai 201306, People's Republic of China
3Shanghai Key Laboratory of PMMP, East China Normal University, Shanghai 200241, People's Republic of China
4Department of Medical Informatics, Erasmus MC, Rotterdam, the Netherlands
5Institute of Health Sciences, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, People's Republic of China

Tài liệu tham khảo

Filipp, 2017, Precision medicine driven by cancer systems biology, Cancer Metastasis Rev, 36, 91, 10.1007/s10555-017-9662-4 Archer, 2016, Systems approaches to cancer biology, Cancer Res, 76, 6774, 10.1158/0008-5472.CAN-16-1580 Disease, 2017, Global, regional, and national incidence, prevalence, and years lived with disability for 328 diseases and injuries for 195 countries, 1990-2016: a systematic analysis for the global burden of disease study 2016, Lancet, 390, 1211, 10.1016/S0140-6736(17)32154-2 Disease, 2016, Global, regional, and national incidence, prevalence, and years lived with disability for 310 diseases and injuries, 1990-2015: a systematic analysis for the global burden of disease study 2015, Lancet, 388, 1545, 10.1016/S0140-6736(16)31678-6 Global Burden of Disease Study, C, 2015, Global, regional, and national incidence, prevalence, and years lived with disability for 301 acute and chronic diseases and injuries in 188 countries, 1990-2013: a systematic analysis for the global burden of disease study 2013, Lancet, 386 Mortimer, 2014, New developments in clinical aspects of lymphatic disease, J Clin Invest, 124, 915, 10.1172/JCI71608 Aune, 2016, Nut consumption and risk of cardiovascular disease, total cancer, all-cause and cause-specific mortality: a systematic review and dose-response meta-analysis of prospective studies, BMC Med, 14, 207, 10.1186/s12916-016-0730-3 Loven, 2012, Revisiting global gene expression analysis, Cell, 151, 476, 10.1016/j.cell.2012.10.012 Holt, 2013, The central dogma decentralized: new perspectives on RNA function and local translation in neurons, Neuron, 80, 648, 10.1016/j.neuron.2013.10.036 Bagger, 2013, HemaExplorer: a database of mRNA expression profiles in normal and malignant haematopoiesis, Nucleic Acids Res, 41, D1034, 10.1093/nar/gks1021 Zheng, 2014, Comparative analysis of gene expression profiles in basal-like carcinomas of the breast, Anal Quant Cytopathol Histpathol, 36, 82 Bergthold, 2015, Expression profiles of 151 pediatric low-grade gliomas reveal molecular differences associated with location and histological subtype, Neuro Oncol, 17, 1486, 10.1093/neuonc/nov045 Medh, 2002, Microarray-based expression profiling of normal and malignant immune cells, Endocr Rev, 23, 393, 10.1210/edrv.23.3.0470 Liang, 2014, Single-cell sequencing technologies: current and future, J Genet Genomics, 41, 513, 10.1016/j.jgg.2014.09.005 Schotte, 2011, Discovery of new microRNAs by small RNAome deep sequencing in childhood acute lymphoblastic leukemia, Leukemia, 25, 1389, 10.1038/leu.2011.105 Jacob, 1989, Polyadenylate polymerases from normal and cancer cells and their potential role in messenger RNA processing: a review, Cancer Res, 49, 2827 Alizadeh, 2000, Genomic-scale gene expression profiling of normal and malignant immune cells, Curr Opin Immunol, 12, 219, 10.1016/S0952-7915(99)00078-3 Gene Ontology, 2015, Gene ontology consortium: going forward, Nucleic Acids Res, 43, D1049, 10.1093/nar/gku1179 Kanehisa, 2016, KEGG as a reference resource for gene and protein annotation, Nucleic Acids Res, 44, D457, 10.1093/nar/gkv1070 Chen, 2015, Integrating GO and KEGG terms to characterize and predict acute myeloid leukemia-related genes, Hematology, 20, 336, 10.1179/1607845414Y.0000000209 Padmanabhan, 2012, Functional annotation of hierarchical modularity, PLoS One, 7, 10.1371/journal.pone.0033744 Uhlen, 2017, A pathology atlas of the human cancer transcriptome, Science, 357 Peng, 2005, Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy, IEEE Trans Pattern Anal Mach Intell, 27, 1226, 10.1109/TPAMI.2005.159 Liu, 1998, Incremental feature selection, Appl Intell, 9, 217, 10.1023/A:1008363719778 Hochreiter, 1997, Long short-term memory, Neural Comput, 9, 1735, 10.1162/neco.1997.9.8.1735 Carmona-Saez, 2007, GENECODIS: a web-based tool for finding significant concurrent annotations in gene lists, Genome Biol, 8, R3, 10.1186/gb-2007-8-1-r3 Huang, 2012, SySAP: a system-level predictor of deleterious single amino acid polymorphisms, Protein Cell, 3, 38, 10.1007/s13238-011-1130-2 Szklarczyk, 2015, STRING v10: protein-protein interaction networks, integrated over the tree of life, Nucleic Acids Res, 43, D447, 10.1093/nar/gku1003 Kononenko, 1997, Overcoming the myopia of inductive learning algorithms with RELIEFF, Appl Intell, 7, 39, 10.1023/A:1008280620621 Zou, 2016, A novel features ranking metric with application to scalable visual and bioinformatics data classification, Neurocomputing, 173, 346, 10.1016/j.neucom.2014.12.123 Liu, 2017, Analysis and prediction of drug-drug interaction by minimum redundancy maximum relevance and incremental feature selection, J Biomol Struct Dyn, 35, 312, 10.1080/07391102.2016.1138142 Chen, 2017, Analysis of cancer-related lncRNAs using gene ontology and KEGG pathways, Artif Intell Med, 76, 27, 10.1016/j.artmed.2017.02.001 Chen, 2018, Discriminating cirRNAs from other lncRNAs using a hierarchical extreme learning machine (H-ELM) algorithm with feature selection, Mol Genet Genomics, 293, 137, 10.1007/s00438-017-1372-7 Chen, 2017, vol. 5, 26582 Ma, 2015, Sequence-based Prediction of RNA-Binding Proteins using Random Forest with Minimum Redundancy Maximum Relevance Feature selection, Biomed Res Int, 2015, 425810, 10.1155/2015/425810 Chen, 2016, Identification of compound–protein interactions through the analysis of gene ontology, KEGG enrichment for proteins and molecular fragments of compounds, Mol Genet Genomics, 291, 2065, 10.1007/s00438-016-1240-x Chen, 2018, Gene expression differences among different MSI statuses in colorectal cancer, Int J Cancer, 143, 1731, 10.1002/ijc.31554 Li, 2019, Identification of synthetic lethality based on a functional network by using machine learning algorithms, J Cell Biochem, 120, 405, 10.1002/jcb.27395 Zhao, 2018, A similarity-based method for prediction of drug side effects with heterogeneous information, Math Biosci, 306, 136, 10.1016/j.mbs.2018.09.010 Chen, 2018, Tissue Expression Difference between mRNAs and lncRNAs, Int J Mol Sci, 19, 3416, 10.3390/ijms19113416 Kohavi, 1995, A study of cross-validation and bootstrap for accuracy estimation and model selection Pan, 2018, Prediction of RNA-protein sequence and structure binding preferences using deep convolutional and recurrent neural networks, BMC Genomics, 19, 10.1186/s12864-018-4889-1 Abadi, 2016, TensorFlow: A system for large-scale machine learning, 265 Chen, 2017, Identification of drug-drug interactions using chemical interactions, Curr Bioinforma, 12, 526, 10.2174/1574893611666160618094219 Chen, 2012, Predicting Anatomical Therapeutic Chemical (ATC) classification of drugs by integrating chemical-chemical interactions and similarities, PLoS One, 7, e35254, 10.1371/journal.pone.0035254 Guo, 2018, A network integration method for deciphering the types of metabolic pathway of chemicals with heterogeneous information, Comb Chem High Throughput Screen Pan, 2018, Identifying patients with atrioventricular septal defect in down syndrome populations by using self-normalizing neural networks and feature selection, Gen, 9, 208 Wang, 2018, Identification of differentially expressed genes between original breast cancer and xenograft using machine learning algorithms, Gen, 9, 155 Chen, 2018, Identification of gene expression signatures across different types of neural stem cells with the Monte-Carlo feature selection method, J Cell Biochem, 119, 3394, 10.1002/jcb.26507 Matthews, 1975, Comparison of the predicted and observed secondary structure of T4 phage lysozyme, Biochimica et Biophysica Acta (BBA)-Protein Struct, 405, 442, 10.1016/0005-2795(75)90109-9 Chen, 2018, Prediction of nitrated tyrosine residues in protein sequences by extreme learning machine and feature selection methods, Comb Chem High Throughput Screen, 21, 393, 10.2174/1386207321666180531091619 Youden, 1950, Index for rating diagnostic tests, Cancer, 3, 32, 10.1002/1097-0142(1950)3:1<32::AID-CNCR2820030106>3.0.CO;2-3 Lee, 2017, Drug repositioning for enzyme modulator based on human metabolite-likeness, BMC Bioinforma, 18, 226, 10.1186/s12859-017-1637-5 Wang, 2018, Identification and analysis of the cleavage site in a signal peptide using SMOTE, dagging, and feature selection methods, Mol Omics, 14, 64, 10.1039/C7MO00030H Khan, 2018, RAFP-Pred: robust prediction of antifreeze proteins using localized analysis of n-peptide compositions, IEEE/ACM Trans Comput Biol Bioinform, 15, 244, 10.1109/TCBB.2016.2617337 Breiman, 2001, Random forests, Mach Learn, 45, 5, 10.1023/A:1010933404324 Pan, 2009, Robust prediction of B-factor profile from sequence using two-stage SVR based on random forest feature selection, Protein Pept Lett, 16, 1447, 10.2174/092986609789839250 Pan, 2010, Large-scale prediction of human protein-protein interactions from amino acid sequence based on latent topic features, J Proteome Res, 9, 4992, 10.1021/pr100618t Chen, 2015, Prediction and analysis of cell-penetrating peptides using pseudo-amino acid composition and random forest models, Amino Acids, 47, 1485, 10.1007/s00726-015-1974-5 Marques, 2016, Mirnacle: machine learning with SMOTE and random forest for improving selectivity in pre-miRNA ab initio prediction, BMC Bioinforma, 17, 474, 10.1186/s12859-016-1343-8 Zhang, 2017, Predicting citrullination sites in protein sequences using mRMR method and random forest algorithm, Comb Chem High Throughput Screen, 20, 164, 10.2174/1386207319666161227124350 Lippens, 2011, Keratinocyte-specific ablation of the NF-kappaB regulatory protein A20 (TNFAIP3) reveals a role in the control of epidermal homeostasis, Cell Death Differ, 18, 1845, 10.1038/cdd.2011.55 Meitzen, 2013, Enhanced striatal beta1-adrenergic receptor expression following hormone loss in adulthood is programmed by both early sexual differentiation and puberty: a study of humans and rats, Endocrinology, 154, 1820, 10.1210/en.2012-2131 Safi, 2014, Differential expression and role of hyperglycemia induced oxidative stress in epigenetic regulation of beta1, beta2 and beta3-adrenergic receptors in retinal endothelial cells, BMC Med Genomics, 7, 29, 10.1186/1755-8794-7-29 Trueb, 2013, Role of FGFRL1 and other FGF signaling proteins in early kidney development, Cell Mol Life Sci, 70, 2505, 10.1007/s00018-012-1189-9 Weinkauf, 2012, Local gene expression changes after UV-irradiation of human skin, PLoS One, 7, e39411, 10.1371/journal.pone.0039411 Wickramasinghe, 2015, Control of mammalian gene expression by selective mRNA export, Nat Rev Mol Cell Biol, 16, 431, 10.1038/nrm4010 Arib, 2011, Multiple facets of nuclear periphery in gene expression control, Curr Opin Cell Biol, 23, 346, 10.1016/j.ceb.2010.12.005 Reichenstein, 2008, Involvement of pituitary adenylate cyclase activating polypeptide (PACAP) and its receptors in the mechanism of antidepressant action, J Mol Neurosci, 36, 330, 10.1007/s12031-008-9116-0 Chik, 1997, Alpha 1D L-type Ca(2+)-channel currents: inhibition by a beta-adrenergic agonist and pituitary adenylate cyclase-activating polypeptide (PACAP) in rat pinealocytes, J Neurochem, 68, 1078, 10.1046/j.1471-4159.1997.68031078.x Joseph-Strauss, 2007, Spore germination in Saccharomyces cerevisiae: global gene expression patterns and cell cycle landmarks, Genome Biol, 8, R241, 10.1186/gb-2007-8-11-r241 Dunn, 1998, Regulation of a hair follicle keratin intermediate filament gene promoter, J Cell Sci, 3487, 10.1242/jcs.111.23.3487 Chen, 2017, Increased neuronal differentiation of neural progenitor cells derived from phosphovimentin-deficient mice, Mol Neurobiol, 55, 5478, 10.1007/s12035-017-0759-0 Jia, 2017, Dysregulation of CUL4A and CUL4B ubiquitin ligases in lung cancer, J Biol Chem, 292, 2966, 10.1074/jbc.M116.765230 Qi, 2015, Dysregulation of ubiquitin ligases in cancer, Drug Resist Updat, 23, 1, 10.1016/j.drup.2015.09.001 Barbi, 2015, Ubiquitin-dependent regulation of Foxp3 and Treg function, Immunol Rev, 266, 27, 10.1111/imr.12312 Bali, 2013, Genome-wide identification and functional analyses of microRNA signatures associated with cancer pain, EMBO Mol Med, 5, 1740, 10.1002/emmm.201302797 Pusztai, 2014, Gene signature-guided dasatinib therapy in metastatic breast cancer, Clin Cancer Res, 20, 5265, 10.1158/1078-0432.CCR-14-0800 Falk, 2014, Pain and nociception: mechanisms of cancer-induced bone pain, J Clin Oncol, 32, 1647, 10.1200/JCO.2013.51.7219 Di Giammartino, 2014, RBBP6 isoforms regulate the human polyadenylation machinery and modulate expression of mRNAs with AU-rich 3' UTRs, Genes Dev, 28, 2248, 10.1101/gad.245787.114 Luo, 2014, Post-transcriptionalup-regulation of PDGF-C by HuR in advanced and stressed breast cancer, Int J Mol Sci, 15, 20306, 10.3390/ijms151120306 Zhang, 2015, The superoxide dismutase 1 3'UTR maintains high expression of the SOD1 gene in cancer cells: The involvement of the RNA-binding protein AUF-1, Free Radic Biol Med, Vol. 85, 33, 10.1016/j.freeradbiomed.2015.04.012 Sobolewski, 2015, Histone deacetylase inhibitors activate tristetraprolin expression through induction of early growth response protein 1 (EGR1) in colorectal cancer cells, Biomolecules, 5, 2035, 10.3390/biom5032035 Awazu, 2017, BMP7 dose-dependently stimulates proliferation and cadherin-11 expression via ERK and p38 in a murine metanephric mesenchymal cell line, Physiol Rep, 5, 10.14814/phy2.13378 Couillard, 2009, C-myc as a modulator of renal stem/progenitor cell population, Dev Dyn, 238, 405, 10.1002/dvdy.21841 Drummond, 1998, Expression of fetal kidney growth factors in a kidney tumor line: role of FGF2 in kidney development, Exp Nephrol, 6, 522, 10.1159/000020567 Ma, 2017, Estrogen receptor beta as a therapeutic target in breast cancer stem cells, J Natl Cancer Inst, 109, 1, 10.1093/jnci/djw236 Haldosen, 2014, Estrogen receptor beta in breast cancer, Mol Cell Endocrinol, 382, 665, 10.1016/j.mce.2013.08.005 Gevorgyan, 2016, HER2-positive neuroendocrine breast cancer: case report and review of literature, Breast Care (Basel), 11, 424, 10.1159/000453572 Meehan, 2017, HER2 mRNA transcript quantitation in breast cancer, Clin Transl Oncol, 19, 606, 10.1007/s12094-016-1573-2 Wang, 2010, AFP computational secreted network construction and analysis between human hepatocellular carcinoma (HCC) and no-tumor hepatitis/cirrhotic liver tissues, Tumour Biol, 31, 417, 10.1007/s13277-010-0050-8 Gelis, 2017, Functional expression of olfactory receptors in human primary melanoma and melanoma metastasis, Exp Dermatol, 26, 569, 10.1111/exd.13316 Jin, 2017, Crosstalk in competing endogenous RNA network reveals the complex molecular mechanism underlying lung cancer, Oncotarget, 8, 91270, 10.18632/oncotarget.20441 Ranzani, 2017, Revisiting olfactory receptors as putative drivers of cancer, Wellcome Open Res, 2, 9, 10.12688/wellcomeopenres.10646.1