Missing covariate data in medical research: To impute is better than to ignore

Journal of Clinical Epidemiology - Tập 63 Số 7 - Trang 721-727 - 2010
Kristel J.M. Janssen1, A. Rogier T. Donders2, Frank E. Harrell3, Yvonne Vergouwe4, Qingxia Chen3, Eric J.G. Sijbrands4, Karel G.M. Moons4
1Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, The Netherlands. [email protected]
2Department of Epidemiology, Biostatistics and Health Technology Assessment, Radboud University Nijmegen Medical Center, Nijmegen, the Netherlands
3Department of Biostatistics, Vanderbilt University School of Medicine, Nashville, TN, USA
4Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, The Netherlands

Tóm tắt

Từ khóa


Tài liệu tham khảo

Burton, 2004, Missing covariate data within cancer prognostic studies: a review of current reporting and proposed guidelines, Br J Cancer, 91, 4, 10.1038/sj.bjc.6601907

Little, 1992, Regression with missing X's: a review, J Am Stat Assoc, 87, 1227, 10.2307/2290664

Little, 1987

Little, 1999, Methods for handling missing values in clinical trials, J Rheumatol, 26, 1654

Rubin, 1987

Rubin, 1996, Multiple imputation after 18+ years, J Am Stat Assoc, 91, 473, 10.2307/2291635

Schafer, 1997

Schafer, 2002, Missing data: our view of the state of the art, Psychol Methods, 7, 147, 10.1037/1082-989X.7.2.147

Barnes, 2006, Multiple imputation techniques in small sample clinical trials, Stat Med, 25, 233, 10.1002/sim.2231

Clark, 2003, Developing a prognostic model in the presence of missing data: an ovarian cancer case study, J Clin Epidemiol, 56, 28, 10.1016/S0895-4356(02)00539-5

Donders, 2006, Review: a gentle introduction to imputation of missing values, J Clin Epidemiol, 59, 1087, 10.1016/j.jclinepi.2006.01.014

Faris, 2002, Multiple imputation versus data enhancement for dealing with missing data in observational health care outcome analyses, J Clin Epidemiol, 55, 184, 10.1016/S0895-4356(01)00433-4

Harrell, 2001

Kmetic, 2002, Multiple imputation to account for missing data in a survey: estimating the prevalence of osteoporosis, Epidemiology, 13, 437, 10.1097/00001648-200207000-00012

Moons, 2006, Using the outcome for imputation of missing predictor values was preferred, J Clin Epidemiol, 59, 1092, 10.1016/j.jclinepi.2006.01.009

Janssen, 2009, Dealing with missing predictor values when applying clinical prediction models, Clin Chem, 55, 994, 10.1373/clinchem.2008.115345

Klebanoff, 2008, Use of multiple imputation in the epidemiologic literature, Am J Epidemiol, 168, 355, 10.1093/aje/kwn071

Oudega, 2005, Ruling out deep venous thrombosis in primary care. A simple diagnostic algorithm including D-dimer testing, Thromb Haemost, 94, 200, 10.1160/TH04-12-0829

Oudega, 2005, Limited value of patient history and physical examination in diagnosing deep vein thrombosis in primary care, Fam Pract, 22, 86, 10.1093/fampra/cmh718

Oudega, 2005, The Wells rule does not adequately rule out deep venous thrombosis in primary care patients, Ann Intern Med, 143, 100, 10.7326/0003-4819-143-2-200507190-00008

van Buuren

Hanley, 1983, A method of comparing the areas under receiver operating characteristic curves derived from the same cases, Radiology, 148, 839, 10.1148/radiology.148.3.6878708

Harrell, 1984, Regression modelling strategies for improved prognostic prediction, Stat Med, 3, 143, 10.1002/sim.4780030207

Collins, 2001, A comparison of inclusive and restrictive strategies in modern missing data procedures, Psychol Methods, 6, 330, 10.1037/1082-989X.6.4.330

Rubin, 1991, Multiple imputation in health-care databases: an overview and some applications, Stat Med, 10, 585, 10.1002/sim.4780100410

Schafer, 1999, Multiple imputation: a primer, Stat Methods Med Res, 8, 3, 10.1191/096228099671525676

Horton, 2001, Multiple imputation in practice: comparison of software packages for regression models with missing variables, Am Stat, 55, 244, 10.1198/000313001317098266

Horton, 2007, Much ado about nothing: a comparison of missing data methods and software to fit incomplete data regression models, Am Stat, 61, 79, 10.1198/000313007X172556