Patient clustering improves efficiency of federated machine learning to predict mortality and hospital stay time using distributed electronic medical records

Journal of Biomedical Informatics - Tập 99 - Trang 103291 - 2019
Huang Li1,2, Andrew Shea3, Huining Qian4, Aditya Masurkar5, Hao Deng6,7,8, Dianbo Liu6,3,9
1Academy of Arts and Design, Tsinghua University, Beijing, 10084, China
2The Future Laboratory, Tsinghua University, Beijing, 10084, China
3Computer Science & Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, United States
4College of Applied Mathematical and Physical Science, Beijing University of Technology, Beijing 100124, China
5School of Engineering, Northeastern University, Boston, MA 02115, United States
6Boston Children's Hospital, Boston, MA 02115, United States
7Department Anesthesia, Critical Care and Pain Medicine, Massachusetts General Hospital, Boston 02115, United States
8School of Public Health, Johns Hopkins University, United States
9Medical School, Harvard University, Boston, MA 02115, United States

Tóm tắt

Từ khóa


Tài liệu tham khảo

Cebul, 2011, Electronic health records and quality of diabetes care, N. Engl. J. Med., 365, 825, 10.1056/NEJMsa1102519

Neumann, 2017, Evaluating quality of dental care among patients with diabetes: adaptation and testing of a dental quality measure in electronic health records, J. Am. Dent. Assoc., 148, 10.1016/j.adaj.2017.04.017

Dorr, 2007, Informatics systems to promote improved care for chronic illness: a literature review, J. Am. Med. Inform. Assoc., 14, 156, 10.1197/jamia.M2255

Bodenheimer, 2002, Improving primary care for patients with chronic illness, JAMA, 288, 1775, 10.1001/jama.288.14.1775

Bodenheimer, 2002, Improving primary care for patients with chronic illness: the chronic care model, part 2, JAMA, 288, 1909, 10.1001/jama.288.15.1909

Podichetty, 2004, The progressive roles of electronic medicine: benefits, concerns, and costs, Am. J. Med. Sci., 328, 94, 10.1097/00000441-200408000-00005

Simon, 2006, An examination of the financial feasibility of electronic medical records (EMRs): a case study of tangible and intangible benefits, Int. J. Electron. Healthc., 2, 185, 10.1504/IJEH.2006.008832

Tierney, 2013, Medical education in the electronic medical record (EMR) era: benefits, challenges, and future directions, Acad. Med., 88, 748, 10.1097/ACM.0b013e3182905ceb

Mani, 2012, Type 2 diabetes risk forecasting from EMR data using machine learning, AMIA Ann. Symp. Proc., 2012, 606

Tran, 2015, Learning vector representation of medical objects via EMR-driven nonnegative restricted Boltzmann machines (eNRBM), J. Biomed. Inform., 54, 96, 10.1016/j.jbi.2015.01.012

Lee, 2017, Deep learning is effective for classifying normal versus age-related macular degeneration OCT images, Ophthalmol. Retina, 1, 322, 10.1016/j.oret.2016.12.009

Shameer, 2017, Predictive modeling of hospital readmission rates using electronic medical record-wide machine learning: a case-study using Mount Sinai heart failure cohort, Pac. Symp. Biocomput., 22, 276

Miotto, 2018, Deep learning for healthcare: review, opportunities and challenges, Brief. Bioinform., 19, 1236, 10.1093/bib/bbx044

Dubovitskaya, 2017, Secure and trustable electronic medical records sharing using blockchain, AMIA Ann. Symp. Proc., 2017, 650

Yang, 2015, A hybrid solution for privacy preserving medical data sharing in the cloud environment, Fut. Gener. Comput. Syst., 43, 74, 10.1016/j.future.2014.06.004

Wu, 2012, Secure sharing of electronic health records in clouds, 711

Perera, 2011, Views on health information sharing and privacy from primary care practices using electronic medical records, Int. J. Med. Inform., 80, 94, 10.1016/j.ijmedinf.2010.11.005

Trinidad, 2010, Genomic research and wide data sharing: views of prospective participants, Genet. Med., 12, 486, 10.1097/GIM.0b013e3181e38f9e

Basavegowda, 2013, Electronic medical report security using visual secret sharing scheme, 78

Middleton, 1998, Use of the WWW for distributed knowledge engineering for an EMR: the KnowledgeBank concept, Proc. AMIA Symp., 126

Russell, 2004, Barriers to EMR adoption in internal medicine and pediatric outpatient practices, Tennessee Med.: J. Tennessee Med. Assoc., 97, 457

Caine, 2013, Patients want granular privacy control over health information in electronic medical records, J. Am. Med. Inform. Assoc., 20, 7, 10.1136/amiajnl-2012-001023

Loukides, 2010, The disclosure of diagnosis codes can breach research participants' privacy, J. Am. Med. Inform. Assoc., 17, 322, 10.1136/jamia.2009.002725

Rantz, 2010, Cost, staffing and quality impact of bedside electronic medical record (EMR) in nursing homes, J. Am. Med. Dir. Assoc., 11, 485, 10.1016/j.jamda.2009.11.010

.B. McMahan, E. Moore, D. Ramage, S. Hampson, Communication-efficient learning of deep networks from decentralized data, arXiv preprint arXiv:1602.05629; 2016.

D. Liu, T. Miller, R. Sayeed, K. Mandl, FADL: Federated-Autonomous Deep Learning for Distributed Electronic Health Record, arXiv preprint arXiv:1811.11400; 2018.

Y. Zhao, M. Li, L. Lai, N. Suda, D. Civin, V. Chandra, Federated Learning with Non-IID Data, arXiv preprint arXiv:1806.00582; 2018.

E. Jeong, S. Oh, H. Kim, J. Park, M. Bennis, S.-L. Kim, Communication-Efficient On-Device Machine Learning: Federated Distillation and Augmentation under Non-IID Private Data, arXiv preprint arXiv:1811.11479; 2018.

L. Huang, Y. Yin, Z. Fu, S. Zhang, H. Deng, D. Liu, LoAdaBoost: Loss-Based AdaBoost Federated Machine Learning on medical Data, arXiv preprint arXiv:1811.12629; 2018.

Rosales, 2010, Guest editorial: special issue on impacting patient care by mining medical data, Data Mining Knowled. Discov., 20, 325, 10.1007/s10618-010-0167-9

Xie, 2016, Unsupervised deep embedding for clustering analysis, Int. Conf. Machine Learn., 478

Greenfield, 2002, Profiling care provided by different groups of physicians: effects of patient case-mix (bias) and physician-level clustering on quality assessment results, Ann. Intern. Med., 136, 111, 10.7326/0003-4819-136-2-200201150-00008

Walsh, 2006, Symptom clustering in advanced cancer, Support. Care Cancer, 14, 831, 10.1007/s00520-005-0899-z

Sanders, 1993, Empirically derived chronic pain patient subgroups: the utility of multidimensional clustering to identify differential treatment effects, Pain, 54, 51, 10.1016/0304-3959(93)90099-B

Pollard, 2018, The eICU collaborative research database, a freely available multi-center database for critical care research, Sci. Data, 5, 180178, 10.1038/sdata.2018.178

Polat, 2008, Computer aided diagnosis of ECG data on the least square support vector machine, Digital Signal Process., 18, 25, 10.1016/j.dsp.2007.05.006

Caruana, 2015, Intelligible models for healthcare: Predicting pneumonia risk and hospital 30-day readmission, 1721

Hashi, 2017, An expert clinical decision support system to predict disease using classification techniques, 396

Cogley, 2012, Analyzing patient records to establish if and when a patient suffered from a medical condition, 38

Wang, 2013, Action recognition with improved trajectories, 3551

Bansal, 2012, Coreference semantics from web features, 389

Tsymbal, 2006, Dynamic integration with random forests, 801

Godbole, 2004, Discriminative methods for multi-labeled classification, 22

Vincent, 2010, Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion, J. Mach. Learn. Res., 3371

Youden, 1950, Index Rating Diagnostic Tests, 3, 32

Berrar, 2011, Caveats and pitfalls of ROC analysis in clinical microarray research (and how to avoid them), Briefings Bioinf., 13, 83, 10.1093/bib/bbr008

Halligan, 2015, Disadvantages of using the area under the receiver operating characteristic curve to assess imaging tests: a discussion and proposal for an alternative approach, Eur. Radiol., 25, 932, 10.1007/s00330-014-3487-0