A new method of kernel-smoothing estimation of the ROC curve

Springer Science and Business Media LLC - Tập 79 - Trang 603-634 - 2015
Michał Pulit1
1Wrocław University of Technology, Wrocław, Poland

Tóm tắt

The receiver operating characteristic (ROC) curve is a popular graphical tool for describing the accuracy of a diagnostic test. Based on the idea of estimating the ROC curve as a distribution function, we propose a new kernel smoothing estimator of the ROC curve which is invariant under nondecreasing data transformations. We prove that the estimator has better asymptotic mean squared error properties than some other estimators involving kernel smoothing and we present an easy method of bandwidth selection. By simulation studies, we show that for the limited sample sizes, our proposed estimator is competitive with some other nonparametric estimators of the ROC curve. We also give an example of applying the estimator to a real data set.

Tài liệu tham khảo

Altman N, Leger C (1995) Bandwidth selection for kernel distribution function estimation. J Stat Plan Inference 46:195–214 Bowman A, Hall P, Prvan T (1998) Bandwidth selection for the smoothing of distribution functions. Biometrika 85:799–808 Du P, Tang L (2009) Transformation-invariant and nonparametric monotone smooth estimation of ROC curves. Stat Med 28:349–359 Gu J, Ghosal S, Roy A (2008) Bayesian bootstrap estimation of ROC curve. Stat Med 27:5407–5420 Hall PG, Hyndmann RJ (2003) Improved methods for bandwidth selection when estimating ROC curves. Stat Probab Lett 64:181–189 Hsieh F, Turnbull BW (1996) Nonparametric and semiparametric estimation of the receiver operating characteristic curve. Ann Stat 24:25–40 Jokiel-Rokita A, Pulit M (2012) Nonparametric estimation of the ROC curve based on smoothed empirical distribution functions. Stat Comput 23:703–712 Krzanowski WJ, Hand DJ (2009) ROC curves for continuous data. Chapman and Hall/CRC, London Lloyd CJ (1998) Using smoothed receiver operating characteristic curves to summarize and compare diagnostic systems. J Am Stat Assoc 93:1356–1364 Lloyd CJ, Zhou Y (1999) Kernel estimators of the ROC curve are better than empirical. Stat Probab Lett 44:221–228 Peng L, Zhou XH (2004) Local linear smoothing of receiver operating characteristic (ROC) curves. J Stat Plan Inference 118:129–143 Pepe MS (2003) The statistical evaluation of medical tests for classification and prediction. Oxford University Press, Oxford Polansky AM, Baker ER (2000) Multistage plug-in bandwidth selection for kernel distribution function estimates. J Stat Comput Simul 65:63–80 Sarda P (1993) Smoothing parameter selection for smooth distribution functions. J Stat Plan Inference 35:65–75 Swets JA (1988) Measuring the accuracy of diagnostic systems. Science 240:1285–1293 Tang L, Du P, Wu C (2010) Compare diagnostic tests using transformation-invariant smoothed ROC curves. J Stat Plan Inference 140:3540–3551 Tonini G, Fratto ME, Imperatori M, Pantano F, Vincenzi B, Santini D (2011) Predictive factors of response to treatment in patients with metastatic renal cell carcinoma: new evidence. Expert Rev Anticancer Ther 11(6):921–930 Zhou XH, Harezlak J (2002) Comparison of bandwidth selection methods for kernel smoothing of ROC curves. Stat Med 21:2045–2055 Znidaric M (2009) Asymptotic expansion for inverse moments of binomial and Poisson distributions. Open Stat Probab J 1:710 Zou KH, Hall WJ, Shapiro DE (1997) Smooth non-parametric receiver operating characteristic (ROC) curves for continuous diagnostic tests. Stat Med 16:2143–2156