Cardiovascular disease risk prediction using automated machine learning: A prospective study of 423,604 UK Biobank participants

PLoS ONE - Tập 14 Số 5 - Trang e0213653
Ahmed M. Alaa1, Thomas Bolton2,3, Emanuele Di Angelantonio2,3, James H.F. Rudd4, Mihaela van der Schaar5,1,6
1University of California, Los Angeles, Los Angeles, California, United States of America
2Department of Public Health and Primary Care, University of Cambridge, Cambridge, United Kingdom
3National Institute for Health Research (NIHR) Blood and Transplant Research Unit (BTRU) in Donor Health and Genomics, University of Cambridge, Cambridge, United Kingdom
4Department of Cardiovascular Medicine, University of Cambridge and Cambridge University Hospitals NHS Foundation Trust, Cambridge, United Kingdom
5Alan Turing Institute, London, United Kingdom
6University of Oxford, Oxford, United Kingdom

Tóm tắt

Từ khóa


Tài liệu tham khảo

MR Thomas, 2017, Novel risk markers and risk assessments for cardiovascular disease, Circulation research, 120, 133, 10.1161/CIRCRESAHA.116.309955

PM Ridker, 2008, Rosuvastatin to prevent vascular events in men and women with elevated C-reactive protein, New England Journal of Medicine, 359, 2195, 10.1056/NEJMoa0807646

HM Kremers, 2008, High ten-year risk of cardiovascular disease in newly diagnosed rheumatoid arthritis patients: A population-based cohort study, Arthritis & Rheumatology, 58, 2268, 10.1002/art.23650

RB D’Agostino, 2008, General cardiovascular risk profile for use in primary care: the Framingham Heart Study, Circulation, 117, 743, 10.1161/CIRCULATIONAHA.107.699579

R Conroy, 2003, Estimation of ten-year risk of fatal cardiovascular disease in Europe: the SCORE project, European heart journal, 24, 987, 10.1016/S0195-668X(03)00114-3

L Sjöström, 2004, Lifestyle, diabetes, and cardiovascular risk factors 10 years after bariatric surgery, New England Journal of Medicine, 351, 2683, 10.1056/NEJMoa035622

P Greenland, 2010, 2010 ACCF/AHA guideline for assessment of cardiovascular risk in asymptomatic adults: a report of the American College of Cardiology Foundation/American Heart Association task force on practice guidelines developed in collaboration with the American Society of Echocardiography, American Society of Nuclear Cardiology, Society of Atherosclerosis Imaging and Prevention, Society for Cardiovascular Angiography and Interventions, Society of Cardiovascular Computed Tomography, and Society for Cardiovascular Magnetic Resonance, Journal of the American College of Cardiology, 56, e50, 10.1016/j.jacc.2010.09.001

MF Piepoli, 2016, 2016 European Guidelines on cardiovascular disease prevention in clinical practice: The Sixth Joint Task Force of the European Society of Cardiology and Other Societies on Cardiovascular Disease Prevention in Clinical Practice (constituted by representatives of 10 societies and by invited experts) Developed with the special contribution of the European Association for Cardiovascular Prevention & Rehabilitation (EACPR), Atherosclerosis, 252, 207, 10.1016/j.atherosclerosis.2016.05.037

J Hippisley-Cox, 2008, Predicting cardiovascular risk in England and Wales: prospective derivation and validation of QRISK2, Bmj, 336, 1475, 10.1136/bmj.39609.449676.25

J Hippisley-Cox, 2017, Development and validation of QRISK3 risk prediction algorithms to estimate future risk of cardiovascular disease: prospective cohort study, bmj, 357, j2099, 10.1136/bmj.j2099

GC Siontis, 2012, Comparisons of established risk prediction models for cardiovascular disease: systematic review, Bmj, 344, e3318, 10.1136/bmj.e3318

RL Coleman, 2007, Framingham, SCORE, and DECODE risk equations do not provide reliable cardiovascular risk estimates in type 2 diabetes, Diabetes care, 30, 1292, 10.2337/dc06-1358

P McEwan, 2004, Evaluating the performance of the Framingham risk equations in a population with diabetes, Diabetic medicine, 21, 318, 10.1111/j.1464-5491.2004.01139.x

I Martín-Timón, 2014, Type 2 diabetes and cardiovascular disease: have all risk factors the same strength?, World journal of diabetes, 5, 444, 10.4239/wjd.v5.i4.444

JB Buse, 2007, Primary prevention of cardiovascular diseases in people with diabetes mellitus: a scientific statement from the American Heart Association and the American Diabetes Association, Circulation, 115, 114, 10.1161/CIRCULATIONAHA.106.179294

B Ambale-Venkatesh, 2017, Cardiovascular event prediction by machine learning: the Multi-Ethnic Study of Atherosclerosis, Circulation research

T Ahmad, 2018, Machine Learning Methods Improve Prognostication, Identify Clinically Distinct Phenotypes, and Detect Heterogeneity in Response to Therapy in a Large Cohort of Heart Failure Patients, Journal of the American Heart Association, 7, e008081, 10.1161/JAHA.117.008081

SF Weng, 2017, Can machine-learning improve cardiovascular risk prediction using routine clinical data?, PloS one, 12, e0174944, 10.1371/journal.pone.0174944

Alaa AM, van der Schaar M. AutoPrognosis: Automated Clinical Prognostic Modeling via Bayesian Optimization with Structured Kernel Learning. International Conference on Machine Learning. 2018;.

C Sudlow, 2015, UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age, PLoS medicine, 12, e1001779, 10.1371/journal.pmed.1001779

LJ Palmer, 2007, UK Biobank: bank on it, The Lancet, 369, 1980, 10.1016/S0140-6736(07)60924-6

A Ganna, 2015, 5 year mortality predictors in 498 103 UK Biobank participants: a prospective population-based study, The Lancet, 386, 533, 10.1016/S0140-6736(15)60175-1

L Adamska, 2015, Challenges of linking to routine healthcare records in UK Biobank, Trials, 16, O68, 10.1186/1745-6215-16-S2-O68

DC Goff, 2014, 2013 ACC/AHA guideline on the assessment of cardiovascular risk: a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines, Journal of the American College of Cardiology, 63, 2935

DJ Stekhoven, 2011, MissForest—non-parametric missing value imputation for mixed-type data, Bioinformatics, 28, 112, 10.1093/bioinformatics/btr597

R Tibshirani, 1996, Regression shrinkage and selection via the lasso, Journal of the Royal Statistical Society Series B (Methodological), 267, 10.1111/j.2517-6161.1996.tb02080.x

MA Hearst, 1998, Support vector machines, IEEE Intelligent Systems and their applications, 13, 18, 10.1109/5254.708428

L Breiman, 2001, Random forests, Machine learning, 45, 5, 10.1023/A:1010933404324

Y LeCun, 2015, Deep learning, nature, 521, 436, 10.1038/nature14539

G Rätsch, 2001, Soft margins for AdaBoost, Machine learning, 42, 287, 10.1023/A:1007618119488

JH Friedman, 2001, Greedy function approximation: a gradient boosting machine, Annals of statistics, 1189

F Pedregosa, 2011, Scikit-learn: Machine learning in Python, Journal of machine learning research, 12, 2825

Z Ghahramani, 2015, Probabilistic machine learning and artificial intelligence, Nature, 521, 452, 10.1038/nature14541

J Snoek, 2012, Practical bayesian optimization of machine learning algorithms, Advances in neural information processing systems, 2951

Chen T, Guestrin C. Xgboost: A scalable tree boosting system. In: Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining. ACM; 2016. p. 785–794.

C Strobl, 2008, Conditional variable importance for random forests, BMC bioinformatics, 9, 307, 10.1186/1471-2105-9-307

MA Allison, 2008, A high ankle-brachial index is associated with increased cardiovascular disease morbidity and lower quality of life, Journal of the American College of Cardiology, 51, 1292, 10.1016/j.jacc.2007.11.064

BW Hesse, 2005, Trust and sources of health information: the impact of the Internet and its implications for health care providers: findings from the first Health Information National Trends Survey, Archives of internal medicine, 165, 2618, 10.1001/archinte.165.22.2618

TA Gaziano, 2008, Laboratory-based versus non-laboratory-based method for assessment of cardiovascular disease risk: the NHANES I Follow-up Study cohort, The Lancet, 371, 923, 10.1016/S0140-6736(08)60418-3

S Mendis, 2007, World Health Organization (WHO) and International Society of Hypertension (ISH) risk prediction charts: assessment of cardiovascular risk for prevention and control of cardiovascular disease in low and middle-income countries, Journal of hypertension, 25, 1578, 10.1097/HJH.0b013e3282861fd3

G Assmann, 2002, Simple scoring scheme for calculating the risk of acute coronary events based on the 10-year follow-up of the prospective cardiovascular Münster (PROCAM) study, Circulation, 105, 310, 10.1161/hc0302.102575

K Eeg-Olofsson, 2010, New aspects of HbA1c as a risk factor for cardiovascular diseases in type 2 diabetes: an observational study from the Swedish National Diabetes Register (NDR), Journal of internal medicine, 268, 471, 10.1111/j.1365-2796.2010.02265.x

ERF Collaboration, 2012, C-reactive protein, fibrinogen, and cardiovascular disease prediction, New England Journal of Medicine, 367, 1310, 10.1056/NEJMoa1107477

P Willeit, 2016, Natriuretic peptides and integrated risk assessment for cardiovascular disease: an individual-participant-data meta-analysis, The Lancet Diabetes & Endocrinology, 4, 840, 10.1016/S2213-8587(16)30196-6