Topic modelling in precision medicine with its applications in personalized diabetes management

Expert Systems - Tập 39 Số 4 - 2022
Chong Ni Ki1, Amin Hosseinian‐Far2, Alireza Daneshkhah3, Nader Salari4
1School of Computing, Electronics and Mathematics, Coventry University, Coventry, UK
2Centre for Sustainable Business Practices, University of Northampton, Northampton, UK
3Research Centre for Computational Science and Mathematical Modelling, Coventry University, Coventry, UK
4Department of Biostatistics, Kermanshah University of Medical Sciences, Kermanshah, Iran

Tóm tắt

AbstractAdvances in Internet of Things (IoT) and analytic‐based systems in the past decade have found several applications in medical informatics, and have significantly facilitated healthcare decision making. Patients' data are collected through a variety of means, including IoT sensory systems, and require efficient, and accurate processing. Topic Modelling is an unsupervised machine learning algorithm for Natural Language Processing (NLP) that identifies relationships and associations within textual data. The application of Topic Modelling has been widely used on raw text data, where meaningful clusters (topics) are generated by the model. The purpose of this paper is to explore the varying methods of Topic Modelling, mostly the Latent Dirichlet allocation (LDA) model, and its applicability on personalized diabetes management. The proposed study evaluates the possibility of applying topic modelling methods on diabetes literature and genomic data in order to achieve precision medicine.

Từ khóa


Tài liệu tham khảo

10.1007/978-1-4614-3223-4_4

Aletras N., 2013, Proceedings of the 10th International Conference on Computational Semantics (IWCS 2013)–Long Papers, 13

10.4239/wjd.v4.i4.114

10.2337/db15-0822

10.14745/ccdr.v46i06a02

10.1093/hmg/ddx429

10.2174/1874467212666190930144349

10.2337/dc16-0826

10.1016/j.artmed.2020.101913

10.1145/2133806.2133826

10.1136/bmjopen-2019-036453

10.1016/j.mayocp.2018.07.018

10.1109/JIOT.2015.2417684

Chang J., 2009, Advances in neural information processing systems, 288

10.1007/s11042-017-5186-8

10.1007/s10916-017-0861-x

Chang V., 2019, 4th International Conference on Complexity, Future Information Systems and Risk, 121

10.1016/j.jocs.2017.09.002

10.1142/S0219720020500237

10.1093/jamia/ocw136

10.3892/etm.2016.3920

10.1016/j.eswa.2013.01.014

Cvitanic T., 2016, International Conference on Case‐Based Reasoning

10.1007/978-3-030-50613-1_3

Dennis F. R..(2020).Improving healthcare services in the rural counties of the Palmetto state: A factor analysis study(PhD thesis). Keiser University.

Deshkar S., 2017, A review on IoT based m‐health systems for diabetes, International Journal of Computer Science and Telecommunications, 8, 13

Diabetes UK. (2020).Maturity onset diabetes of the young (Mody). Retrieved fromhttps://www.diabetes.org.uk/diabetes-the-basics/other-types-of-diabetes/mody

10.1371/journal.pone.0179488

10.1002/1098-1004(200011)16:5<377::AID-HUMU1>3.0.CO;2-2

10.1016/j.future.2017.04.036

10.1007/978-3-030-18732-3

10.1086/518517

10.2337/db18-1263

Gilmartin A. B. H., 2008, Gestational diabetes mellitus, Reviews in Obstetrics and Gynecology, 1, 129

10.2337/db08-0116

10.1016/j.obmed.2020.100183

10.1007/s00125-017-4226-2

10.1177/1932296819867686

10.1007/978-3-319-60137-3_2

10.1016/j.compeleceng.2017.04.010

10.33549/physiolres.934487

10.1109/IEMBS.2011.6091302

10.1155/2017/9731806

10.1016/j.jbi.2018.09.015

10.1109/TSMC.1985.6313426

Kherwa P., 2020, Topic modeling: A comprehensive review, EAI Endorsed Transactions on Scalable Information Systems, 7, 159623

10.1080/01638539809545028

10.14257/ijbsbt.2014.6.1.17

10.1200/JOP.18.00661

10.1016/j.jbi.2016.02.003

10.1007/s10791-010-9141-9

10.3389/fpubh.2019.00349

Mimno D., 2011, Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 262

Navlani A. (2018).Latent semantic analysis using Python. Retrieved fromhttps://www.datacamp.com/community/tutorials/discovering-hidden-topics-python

Naylor R., 2018, Maturity‐onset diabetes of the young overview

Newman D., 2010, Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, 100

10.1111/j.2040-1124.2011.00100.x

Novartis.2016).Voluntary data sharing ‐ Novartis: Clinical trials. Retrieved fromhttps://www.novartisclinicaltrials.com/TrialConnectWeb/voluntarydatasharing.nov

10.4314/jasem.v24i6.13

10.25270/jcp.2018.05.00016

10.2337/db20-0772

10.1016/bs.pmbts.2016.09.001

Qiang J., 2019, Short text topic modeling techniques, applications, and performance: A survey, IEEE Transactions on Knowledge and Data Engineering, 14, 1

10.1109/ISCAIE.2017.8074971

Ramos J., 2003, Proceedings of the First Instructional Conference on Machine Learning, 133

10.2337/dci19-0049

Rehurek R.(2019).models.word2vec‐Word2vec embeddings. Gensim: Topic modelling for humans.

10.1111/dom.12727

Resnik P., 2008, Communication of clinically relevant information in electronic health records: A comparison between structured data and unrestricted physician language, Perspectives in Health Information Management, 2008, 1

10.1161/CIRCULATIONAHA.109.914820

10.1145/2684822.2685324

Salari N., 2020, The prevalence of sleep disturbances among physicians and nurses facing the COVID‐19 patients: A systematic review and meta‐analysis, Globalization and Health, 16, 1

10.1186/s12960-020-00544-1

10.1007/978-3-030-44289-7_6

10.2174/092986707780059643

10.1016/j.ipm.2021.102516

10.2337/diaspect.27.2.87

10.1109/ICC.2012.6364830

10.2196/15511

10.1371/journal.pone.0055272

Vepa A. Saleem A. Rakhshan K. Omar A. Dharmaraj D. Sami J. Parekh S. Ibrahim M. Raza M. Kapila P. Chakrabarti P. Sedighi T. Chatrabgoun O. &Daneshkhah A. (2021).Predicting mortality duration of treatment pulmonary embolism and required ceiling of ventilatory support for COVID‐19 inpatients: A Machine‐Learning Approach.medRxiv.

10.3390/ijerph18126228

10.3390/ijerph17061982

10.2147/DMSO.S230061

Xiao C., 2017, Proceedings of the Thirty‐First AAAI Conference on Artificial Intelligence

Xu G., 2018, Prevalence of diagnosed type 1 and type 2 diabetes among US adults in 2016 and 2017: Population based study, BMJ, 362, 1

10.1145/267954.267957

10.7150/ijbs.24612

10.1016/j.ins.2018.02.005

10.1016/j.diabet.2015.01.003

10.1371/journal.pone.0212112