Variable selection in proportional odds model with informatively interval-censored data
Statistische Hefte - Trang 1-28 - 2023
Tóm tắt
The proportional odds (PO) model is one of the most commonly used models for regression analysis of failure time data in survival analysis. It assumes that the odds of the failure is proportional to the baseline odds at any point in time given the covariate. The model focus on the situation that the ratio of the hazards converges to unity as time goes to infinity, while the proportional hazards (PH) model has a constant ratio of hazards over time. In the paper, we consider a general type of failure time data, case K interval-censored data, that include case I or case II interval-censored data as special cases. We propose a PO model-based unified penalized variable selection procedure that involves minimizing a negative sieve log-likelihood function plus a broken adaptive ridge penalty, with the initial values obtained from the ridge regression estimator. The proposed approach allows dependent censoring, which occurs quite often and could lead to biased or misleading estimates without considering it. We show that the proposed selection method has oracle properties and the estimator is semiparametrically efficient. The numerical studies suggest that the proposed approach works well for practical situations. In addition, the method is applied to an ADNI study that motivates this investigation.
Tài liệu tham khảo
Cai J, Fan J, Li R, Zhou H (2005) Variable selection for multivariate failure time data. Biometrika 92:303–316
Cook R, Lawless J (2007) The statistical analysis of recurrent events. Springer, New York
Cox DR (1972) Regression models and life-tables. J R Stat Soc B 34:187–202
Dai L, Chen K, Sun Z, Liu Z, Li G (2018) Broken adaptive ridge regression and its asymptotic properties. J Multivar Anal 168:334–351
Dicker L, Huang B, Lin X (2013) Variable selection and estimation with seamless-\(L_0\) penalty. Stat Sin 23:929–962
Du M, Sun J (2022) Variable selection for interval-censored failure time data. Int Stat Rev 90:193–215
Du M, Zhao H, Sun J (2021) A unified approach to variable selection for Coxs proportional hazards model with interval-censored failure time data. Stat Methods Med Res 30:1833–1849
Fan J, Li R (2001) Variable selection via nonconcave penalized likelihood and its oracle property. J Am Stat Assoc 96:1348–1360
Fan J, Li R (2002) Variable selection for Coxs proportional hazards model and frailty model. Ann Stat 30:74–99
Hu T, Zhou Q, Sun J (2017) Regression analysis of bivariate current status data under the proportional hazards model. Can J Stat 45:410–424
Huang J (1996) Effcient estimation for the proportional hazards model with interval censoring. Ann Stat 24:540–568
Huang J, Rossini A (1997) Sieve estimation for the proportional-odds failure-time regression model with interval censoring. J Am Stat Assoc 92:960–967
Huang C, Wang M (2004) Joint modeling and estimation for recurrent event processes and failure time data. J Am Stat Assoc 99:1153–1165
Kalbfleisch J, Prentice R (2002) The statistical analysis of failure time data, 2nd edn. Wiley, New York
Klein J, Moeschberger M (2003) Survival analysis: techniques for censored and truncated data, 2nd edn. Springer, New York
Li K, Chan W, Doody RS, Quinn J, Luo S, Initiative ADN (2017) Prediction of conversion to Alzheimers disease with longitudinal measures and time-to-event data. J Alzheimers Dis 58:361–371
Li S, Wu Q, Sun J (2020) Penalized estimation of semiparametric transformation models with interval-censored data and application to Alzheimers disease. Stat Methods Med Res 29:2151–2166
Lu W, Zhang HH (2007) Variable selection for proportional odds model. Stat Med 26:3771–3781
Lv J, Fan Y (2009) A unified approach to model selection and sparse recovery using regularized least squares. Ann Stat 37:3498–3528
Ma L, Hu T, Sun J (2015) Sieve maximum likelihood regression analysis of dependent current status data. Biometrika 102:731–738
Murphy SA, Rossini AJ, van der Vaart AW (1997) Maximum likelihood estimation in the proportional odds model. J Am Stat Assoc 92:968–976
Rossini AJ, Tsiatis AA (1996) A semiparametric proportional odds regression model for the analysis of current status data. J Am Stat Assoc 91:713–721
Scolas S, El Ghouch A, Legrand C, Oulhaj A (2016) Variable selection in a flexible parametric mixture cure model with interval-censored data. Stat Med 35:1210–1225
Shen X (1998) Propotional odds regression and sieve maximum likelihood estimation. Biometrika 85:165–177
Sun J (2006) The statistical analysis of interval-censored failure time data. Springer, New York
Sun L, Li S, Wang L, Song X, Sui X (2022a) Simultaneous variable selection in regression analysis of multivariate interval-censored data. Biometrics 78:1402–1413
Sun Z, Liu Y, Chen K, Li G (2022b) Broken adaptive ridge regression for right-censored survival data. Ann Inst Stat Math 74:69–91
Tibshirani R (1996) Regression shrinkage and selection via the lasso. J R Stat Soc B 58:267–288
Tibshirani R (1997) The Lasso method for variable selection in the Cox model. Stat Med 16:385–395
Wang J, Ghosh SK (2012) Shape restricted nonparametric regression with Bernstein polynomials. Comput Stat Data Anal 56:2729–274
Wang H, Leng C (2007) Unified LASSO estimation by least squares approximation. J Am Stat Assoc 102:1039–1048
Wang L, Wang L (2021) Regression analysis of arbitrarily censored survival data under the proportional odds model. Stat Med 40:3724–3739
Wang H, Li R, Tsai CL (2007) Tuning parameter selectors for the smoothly clipped absolute deviation method. Biometrika 94:553–568
Wang H, Li B, Leng C (2009) Shrinkage tuning parameter selection with a diverging number of parameters. J R Stat Soc B 71:671–683
Wang P, Zhao H, Sun J (2016) Regression analysis of case K interval-censored failure time data in the presence of informative censoring. Biometrics 72:1103–1112
Wang S, Wang C, Wang P, Sun J (2018) Semiparametric analysis of the additive hazards model with informatively interval-censored failure time data. Comput Stat Data Anal 125:1–9
Wang L, Peng B, Bradic J, Li R, Wu Y (2020a) A tuning-free robust and efficient approach to high-dimensional regression. J Am Stat Assoc 115:1700–1714
Wang S, Wang C, Wang P, Sun J (2020b) Estimation of the additive hazards model with case K interval-censored failure time data in the presence of informative censoring. Comput Stat Data Anal 144:1–9
Wang S, Xu D, Wang C, Sun J (2023) Estimation of linear transformation cure models with informatively interval-censored failure time data. J Nonparametric Stat 35:283–301
Wu Y, Cook R (2015) Penalized regression for interval-censored times of disease progression: selection of HLA markers in psoriatic arthritis. Biometrics 71:782–791
Yang S, Prentice RL (1999) Semiparametric inference in the proportional odds regression model. J Am Stat Assoc 94:125–136
Zhang H, Lu WB (2007) Adaptive Lasso for Coxs proportional hazards model. Biometrika 94:1–13
Zhao H, Wu Q, Li G, Sun J (2020) Simultaneous estimation and variable selection for interval-censored data with broken adaptive ridge regression. J Am Stat Assoc 115:204–216
Zhao B, Wang S, Wang C, Sun J (2021) New methods for the additive hazards model with the informatively interval-censored failure time data. Biom J 63:1507–1525
Zou H (2006) The adaptive Lasso and its oracle properties. J Am Stat Assoc 101:1418–1429