iCircDA-MF: identification of circRNA-disease associations based on matrix factorization

Briefings in Bioinformatics - Tập 21 Số 4 - Trang 1356-1367 - 2020
Hang Wei1, Bin Liu2,1
1School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen, Guangdong, China
2School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China

Tóm tắt

AbstractCircular RNAs (circRNAs) are a group of novel discovered non-coding RNAs with closed-loop structure, which play critical roles in various biological processes. Identifying associations between circRNAs and diseases is critical for exploring the complex disease mechanism and facilitating disease-targeted therapy. Although several computational predictors have been proposed, their performance is still limited. In this study, a novel computational method called iCircDA-MF is proposed. Because the circRNA-disease associations with experimental validation are very limited, the potential circRNA-disease associations are calculated based on the circRNA similarity and disease similarity extracted from the disease semantic information and the known associations of circRNA-gene, gene-disease and circRNA-disease. The circRNA-disease interaction profiles are then updated by the neighbour interaction profiles so as to correct the false negative associations. Finally, the matrix factorization is performed on the updated circRNA-disease interaction profiles to predict the circRNA-disease associations. The experimental results on a widely used benchmark dataset showed that iCircDA-MF outperforms other state-of-the-art predictors and can identify new circRNA-disease associations effectively.

Từ khóa


Tài liệu tham khảo

Meng, 2017, CircRNA: functions and properties of a novel potential biomarker for cancer, Mol Cancer, 16, 94, 10.1186/s12943-017-0663-2

Wang, 2013, Computational approaches in detecting non-coding RNA, Curr Genomics, 14, 371, 10.2174/13892029113149990005

Wang, 2014, Briefing in family characteristics of microRNAs and their applications in cancer research, Biochim Biophys Acta Proteins and Proteom, 1844, 191, 10.1016/j.bbapap.2013.08.002

Bartel, 2009, MicroRNAs: target recognition and regulatory functions, Cell, 136, 215, 10.1016/j.cell.2009.01.002

Hansen, 2013, Natural RNA circles function as efficient microRNA sponges, Nature, 495, 384, 10.1038/nature11993

Li, 2015, Exon-intron circular RNAs regulate transcription in the nucleus, Nat Struct Mol Biol, 22, 256, 10.1038/nsmb.2959

Wang, 2017, Circular RNAs in human cancer, Mol Cancer, 16, 25, 10.1186/s12943-017-0598-7

Esteller, 2011, Non-coding RNAs in human disease, Nat Rev Genet, 12, 861, 10.1038/nrg3074

Peng, 2015, The emerging landscape of circular RNA ciRS-7 in cancer (review), Oncol Rep, 33, 2669, 10.3892/or.2015.3904

Yu, 2016, The circular RNA Cdr1as act as an oncogene in hepatocellular carcinoma through targeting miR-7 expression, Plos One, 11

Weng, 2017, Circular RNA ciRS-7—a promising prognostic biomarker and a potential therapeutic target in colorectal cancer, Clin Cancer Res, 23, 3918, 10.1158/1078-0432.CCR-16-2541

Tang, 2017, Silencing CDR1as inhibits colorectal cancer progression through regulating microRNA-7, Onco Targets Ther, 10, 2045, 10.2147/OTT.S131597

Floris, 2017, Regulatory role of circular RNAs and neurological disorders, Mol Neurobiol, 54, 5156, 10.1007/s12035-016-0055-4

Chen, 2013, LncRNADisease: a database for long-non-coding RNA-associated diseases, Nucleic Acids Res, 41, D983, 10.1093/nar/gks1099

Li, 2014, HMDD v2.0: a database for experimentally supported human microRNA and disease associations, Nucleic Acids Res, 42, D1070, 10.1093/nar/gkt1023

Bao, 2019, LncRNADisease 2.0: an updated database of long non-coding RNA-associated diseases, Nucleic Acids Res, 47, D1034, 10.1093/nar/gky905

Huang, 2019, HMDD v3.0: a database for experimentally supported human microRNA-disease associations, Nucleic Acids Res, 47, D1013, 10.1093/nar/gky1010

Zou, 2016, Similarity computation strategies in the microRNA-disease network: a survey, Brief Funct Genomics, 15, 55

Chen, 2019, MicroRNAs and complex diseases: from experimental results to computational models, Brief Bioinform, 20, 515, 10.1093/bib/bbx130

Chen, 2017, Long non-coding RNAs and complex diseases: from experimental results to computational models, Brief Bioinform, 18, 558

Fan, 2018, CircR2Disease: a manually curated database for experimentally supported circular RNAs associated with various diseases, Database, 2018, bay044, 10.1093/database/bay044

Yao, 2018, Circ2Disease: a manually curated database of experimentally validated circRNAs in human disease, Sci Rep, 8, 11018, 10.1038/s41598-018-29360-3

Zhao, 2018, circRNA disease: a manually curated database of experimentally supported circRNA-disease associations, Cell Death Dis, 9, 475, 10.1038/s41419-018-0503-3

Lei, 2018, PWCDA: path weighted method for predicting circRNA-disease associations, Int J Mol Sci, 19, 3410, 10.3390/ijms19113410

Yan, 2018, DWNN-RLS: regularized least squares method for predicting circRNA-disease associations, BMC Bioinformatics, 19, 520, 10.1186/s12859-018-2522-6

Fan, 2018, Prediction of CircRNA-disease associations using KATZ model based on heterogeneous networks, Int J Biol Sci, 14, 1950, 10.7150/ijbs.28260

van Laarhoven, 2011, Gaussian interaction profile kernels for predicting drug-target interaction, Bioinformatics, 27, 3036, 10.1093/bioinformatics/btr500

You, 2017, PBMDA: a novel and effective path-based computational model for miRNA-disease association prediction, PLoS Comput Biol, 13, 10.1371/journal.pcbi.1005455

Luo, 2017, Predicting MicroRNA-disease associations using Kronecker regularized least squares based on heterogeneous omics data, IEEE Access, 5, 2503, 10.1109/ACCESS.2017.2672600

Chen, 2013, Novel human lncRNA-disease association inference based on lncRNA expression profiles, Bioinformatics, 29, 2617, 10.1093/bioinformatics/btt426

Wang, 2007, A new method to measure the semantic similarity of GO terms, Bioinformatics, 23, 1274, 10.1093/bioinformatics/btm087

Lan, 2017, LDAP: a web server for lncRNA-disease association prediction, Bioinformatics, 33, 458, 10.1093/bioinformatics/btw639

Wang, 2010, Inferring the human microRNA functional similarity and functional network based on microRNA-associated diseases, Bioinformatics, 26, 1644, 10.1093/bioinformatics/btq241

Chen, 2017, RKNNMDA: ranking-based KNN for MiRNA-disease association prediction, RNA Biol, 14, 952, 10.1080/15476286.2017.1312226

Kibbe, 2015, Disease Ontology 2015 update: an expanded and updated database of human diseases for linking biomedical knowledge through disease data, Nucleic Acids Res, 43, D1071, 10.1093/nar/gku1011

Chen, 2015, KATZLDA: KATZ measure for the lncRNA-disease association prediction, Sci Rep, 5, 16840, 10.1038/srep16840

Chen, 2018, MDHGI: Matrix Decomposition and Heterogeneous Graph Inference for miRNA-disease association prediction, PLoS Comput Biol, 14, 10.1371/journal.pcbi.1006418

Yan, DNRLMF-MDA:predicting microRNA-disease associations based on similarities of microRNAs and diseases, IEEE/ACM Trans Comput Biol Bioinform, 2017, 1

Chen, 2017, DRMDA: deep representations-based miRNA-disease association prediction, J Cell Mol Med, 22, 472, 10.1111/jcmm.13336

Zhao, 2018, A novel approach based on a weighted interactive network to predict associations of MiRNAs and diseases, Int J Mol Sci, 20, 110, 10.3390/ijms20010110

Zhong, 2018, Circular RNAs function as ceRNAs to regulate and control human cancer progression, Mol Cancer, 17, 79, 10.1186/s12943-018-0827-8

Huang, 2018, Constructing prediction models from expression profiles for large scale lncRNA-miRNA interaction profiling, Bioinformatics, 34, 812, 10.1093/bioinformatics/btx672

Liu, 2017, Inferring microRNA-disease associations by random walk on a heterogeneous network with multiple data sources, IEEE/ACM Trans Comput Biol Bioinform, 14, 905, 10.1109/TCBB.2016.2550432

Hwang, 2018, HumanNet v2: human gene networks for disease research, Nucleic Acids Res, 47, D573, 10.1093/nar/gky1126

Lee, 2011, Prioritizing candidate disease genes by network-based boosting of genome-wide association data, Genome Res, 21, 1109, 10.1101/gr.118992.110

Ezzat, 2017, Drug-target interaction prediction with graph regularized matrix factorization, IEEE/ACM Trans Comput Biol Bioinform, 14, 646, 10.1109/TCBB.2016.2530062

Lian, 2014, GeoMF: joint geographical modeling and matrix factorization for point-of-interest recommendation, Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, 831, 10.1145/2623330.2623638

He, 2016, Fast matrix factorization for online recommendation with implicit feedback, Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 549, 10.1145/2911451.2911489

Luo, 2014, An efficient non-negative matrix-factorization-based approach to collaborative filtering for recommender systems, IEEE Trans Industr Inform, 10, 1273, 10.1109/TII.2014.2308433

Wang, 2017, Improved anticancer drug response prediction in cell lines using matrix factorization with similarity regularization, BMC Cancer, 17, 513, 10.1186/s12885-017-3500-5

Liu, 2016, Neighborhood regularized logistic matrix factorization for drug-target interaction prediction, PLoS Comput Biol, 12, 10.1371/journal.pcbi.1004760

Li, 2017, MCMDA: matrix completion for MiRNA-disease association prediction, Oncotarget, 8, 21187, 10.18632/oncotarget.15061

Zhong, 2018, A non-negative matrix factorization based method for predicting disease-associated miRNAs in miRNA-disease bilayer network, Bioinformatics, 34, 267, 10.1093/bioinformatics/btx546

Xiao, 2018, A graph regularized non-negative matrix factorization method for identifying microRNA-disease associations, Bioinformatics, 34, 239, 10.1093/bioinformatics/btx545

Lu, 2018, Prediction of lncRNA-disease associations based on inductive matrix completion, Bioinformatics, 34, 3357, 10.1093/bioinformatics/bty327

Fu, 2018, Matrix factorization-based data fusion for the prediction of lncRNA-disease associations, Bioinformatics, 34, 1529, 10.1093/bioinformatics/btx794

Lee, 1999, Learning the parts of objects by non-negative matrix factorization, Nature, 401, 788, 10.1038/44565

Lee, 2000, Advances in Neural Information Processing Systems Proceedings, 556

Golub, 1996, Matrix Computations, 3rd edn, 694

Liu, 2014, Progressive image denoising through hybrid graph Laplacian regularization: a unified framework, IEEE Trans Image Process, 23, 1491, 10.1109/TIP.2014.2303638

Cai, 2011, Graph regularized nonnegative matrix factorization for data representation, IEEE Trans Pattern Anal Mach Intell, 33, 1548, 10.1109/TPAMI.2010.231

Facchinei, 2014, Solving quasi-variational inequalities via their KKT conditions, Math Program, 144, 369, 10.1007/s10107-013-0637-0

Liu, 2018, iPromoter-2L: a two-layer predictor for identifying promoters and their types by multi-window-based PseKNC, Bioinformatics, 34, 33, 10.1093/bioinformatics/btx579

Hanley, 1982, The meaning and use of the area under a receiver operating characteristic (roc) curve, Radiology, 143, 29, 10.1148/radiology.143.1.7063747

Fawcett, 2006, An introduction to ROC analysis, Pattern Recognit Lett, 27, 861, 10.1016/j.patrec.2005.10.010

Liu, 2018, HITS-PR-HHblits: protein remote homology detection by combining PageRank and Hyperlink-Induced Topic Search, Brief Bioinform, 10.1093/bib/bby104

Tian, 2017, Circular RNA circHIPK3 promotes NCI-H1299 and NCI-H2170 cell proliferation through miR-379 and its target IGF1, Zhongguo Fei Ai Za Zhi, 20, 459

Chen, 2017, CircRNA_100782 regulates pancreatic carcinoma proliferation through the IL6-STAT3 pathway, Onco Targets Ther, 10, 5783, 10.2147/OTT.S150678

Yao, 2017, Over-expression of CircRNA_100876 in non-small cell lung cancer and its prognostic value, Pathol Res Pract, 213, 453, 10.1016/j.prp.2017.02.011

Sui, 2018, The differentially expressed circular ribonucleic acids of primary hepatic carcinoma following liver transplantation as new diagnostic biomarkers for primary hepatic carcinoma, Tumour Biol, 40, 1, 10.1177/1010428318766928

Li, 2016, Circular RNA expression profile of pancreatic ductal adenocarcinoma revealed by microarray, Cell Physiol Biochem, 40, 1334, 10.1159/000453186

Liu, 2019, A comprehensive review and comparison of existing computational methods for intrinsically disordered protein and region prediction, Brief Bioinform, 20, 330, 10.1093/bib/bbx126

Cheng, 2014, SemFunSim: a new method for measuring disease similarity by integrating semantic and gene functional association, PLoS One, 9

Ni, 2018, Constructing disease similarity networks based on disease module theory, IEEE/ACM Trans Comput Biol Bioinform, 10.1109/TCBB.2018.2817624

Feng, 2015, A locally weighted sparse graph regularized Non-Negative Matrix Factorization method, Neurocomputing, 169, 68, 10.1016/j.neucom.2014.11.091