Time to reality check the promises of machine learning-powered precision medicine

The Lancet Digital Health - Tập 2 - Trang e677-e680 - 2020
Jack Wilkinson1, Kellyn F Arnold2,3, Eleanor J Murray4, Maarten van Smeden5, Kareem Carr6, Rachel Sippy7,8,9, Marc de Kamps2,10, Andrew Beam4, Stefan Konigorski11,12, Christoph Lippert11,12, Mark S Gilthorpe2,3,13, Peter W G Tennant2,3,13
1Centre for Biostatistics, Manchester Academic Health Science Centre, Division of Population Health, Health Services Research and Primary Care, University of Manchester, Manchester, UK
2Leeds Institute for Data Analytics, University of Leeds, Leeds, UK
3Faculty of Medicine and Health, University of Leeds, Leeds, UK
4Department of Epidemiology, Boston University School of Public Health, Boston, MA, USA
5Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, Netherlands
6Department of Biostatistics, Harvard TH Chan School of Public Health, Boston, MA, USA
7Institute for Global Health and Translational Science, SUNY Upstate Medical University, Syracuse, NY, USA
8Department of Geography, University of Florida, Gainesville, FL, USA
9Emerging Pathogens Institute, University of Florida, Gainesville, FL, USA
10School of Computing, University of Leeds, Leeds, UK
11Digital Health & Machine Learning Research Group, Hasso Plattner Institut for Digital Engineering, Potsdam, Germany
12Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA
13Alan Turing Institute, London, UK

Tài liệu tham khảo

Macklon, 2019, Building an evidence base for IVF ‘add-ons’, Reprod Biomed Online, 38, 853, 10.1016/j.rbmo.2019.04.005 Matheny, 2019, Artificial intelligence in health care: a report from the National Academy of Medicine, JAMA, 323, 509, 10.1001/jama.2019.21579 Gombar, 2019, It is time to learn from patients like mine, NPJ Digit Med, 2, 16, 10.1038/s41746-019-0091-3 2019, Next generation public health: towards precision and fairness, Lancet Public Health, 4, e209, 10.1016/S2468-2667(19)30064-7 Bien, 2018, Deep-learning-assisted diagnosis for knee magnetic resonance imaging: development and retrospective validation of MRNet, PLoS Med, 15, 10.1371/journal.pmed.1002699 Hannun, 2019, Cardiologist-level arrhythmia detection and classification in ambulatory electrocardiograms using a deep neural network, Nat Med, 25, 65, 10.1038/s41591-018-0268-3 Rajpurkar, 2018, Deep learning for chest radiograph diagnosis: a retrospective comparison of the CheXNeXt algorithm to practicing radiologists, PLoS Med, 15, 10.1371/journal.pmed.1002686 Vollmer, 2020, Machine learning and artificial intelligence research for patient benefit: 20 critical questions on transparency, replicability, ethics, and effectiveness, BMJ, 368 Davis Gallagher Liu, 2019, A comparison of deep learning performance against health-care professionals in detecting diseases from medical imaging: a systematic review and meta-analysis, Lancet Digital Health, 1, e271, 10.1016/S2589-7500(19)30123-2 van Smeden, 2018, Machine learning compared with pathologist assessment, JAMA, 319, 1725, 10.1001/jama.2018.1466 Nagendran, 2020, Artificial intelligence versus clinicians: systematic review of design, reporting standards, and claims of deep learning studies, BMJ, 368, m689, 10.1136/bmj.m689 Volkmann, 2019, A plea for taking all available clinical information into account when assessing the predictive value of omics data, BMC Med Res Methodol, 19, 162, 10.1186/s12874-019-0802-0 Abràmoff, 2018, Pivotal trial of an autonomous AI-based diagnostic system for detection of diabetic retinopathy in primary care offices, NPJ Digit Med, 1, 39, 10.1038/s41746-018-0040-6 Damen, 2016, Prediction models for cardiovascular disease risk in the general population: systematic review, BMJ, 353 Beede Zhang, 2019, Learning for personalized medicine: a comprehensive review from a deep learning perspective, IEEE Rev Biomed Eng, 12, 194, 10.1109/RBME.2018.2864254 Hernán, 2020 Peters, 2017 Greenland, 1999, Causal diagrams for epidemiologic research, Epidemiology, 10, 37, 10.1097/00001648-199901000-00008 Angrist, 2009 Rothman, 2013, Why representativeness should be avoided, Int J Epidemiol, 42, 1012, 10.1093/ije/dys223 Shmueli, 2010, To explain or to predict?, Statist Sci, 25, 289, 10.1214/10-STS330 Hernán, 2019, A second chance to get causal inference right: a classification of data science tasks, Chance, 32, 42, 10.1080/09332480.2019.1579578 Arnold, 2020, Reflections on modern methods: generalized linear models for prognosis and intervention—theory, practice and implications for machine learning, Int J Epidemiol Senn, 2016, Mastering variation: variance components and personalised medicine, Stat Med, 35, 966, 10.1002/sim.6739 Pearson, 2006, Switching from insulin to oral sulfonylureas in patients with diabetes due to Kir6.2 mutations, N Engl J Med, 355, 467, 10.1056/NEJMoa061759 Sundström, 2019, The Precision HYpertenSIon Care (PHYSIC) study: a double-blind, randomized, repeated cross-over study, Ups J Med Sci, 124, 51, 10.1080/03009734.2018.1498958 Peto, 2011, Current misconception 3: that subgroup-specific trial mortality results often provide a good basis for individualising patient care, Br J Cancer, 104, 1057, 10.1038/bjc.2011.79 Kent, 2018, Personalized evidence based medicine: predictive approaches to heterogeneous treatment effects, BMJ, 363 van Klaveren, 2019, Models with interactions overestimated heterogeneity of treatment effects and were prone to treatment mistargeting, J Clin Epidemiol, 114, 72, 10.1016/j.jclinepi.2019.05.029 Antoniou, 2017, Biomarker-guided non-adaptive trial designs in phase 2 and phase 3: a methodological review, J Pers Med, 7, 1, 10.3390/jpm7010001 Burke, 2015, Three simple rules to ensure reasonably credible subgroup analyses, BMJ, 351