A Smoothing-Based Goodness-of-Fit Test of Covariance for Functional Data

Biometrics - Tập 75 Số 2 - Trang 562-571 - 2019
Stephanie T. Chen1,1,2, Luo Xiao1, Ana‐Maria Staicu1
1Department of Statistics, North Carolina State University, Raleigh, North Carolina
2Stephanie T. Chen,

Tóm tắt

Abstract Functional data methods are often applied to longitudinal data as they provide a more flexible way to capture dependence across repeated observations. However, there is no formal testing procedure to determine if functional methods are actually necessary. We propose a goodness-of-fit test for comparing parametric covariance functions against general nonparametric alternatives for both irregularly observed longitudinal data and densely observed functional data. We consider a smoothing-based test statistic and approximate its null distribution using a bootstrap procedure. We focus on testing a quadratic polynomial covariance induced by a linear mixed effects model and the method can be used to test any smooth parametric covariance function. Performance and versatility of the proposed test is illustrated through a simulation study and three data applications.

Từ khóa


Tài liệu tham khảo

Bai, 2009, Corrections to LRT on large-dimensional covariance matrix by RMT, Ann Stat, 1, 135

Besse, 1986, Principal components analysis of sampled functions, Psychometrika, 51, 285, 10.1007/BF02293986

Bücher, 2011, Testing model assumptions in functional regression models, J Multivariate Anal, 102, 1472, 10.1016/j.jmva.2011.05.014

Cai, 2011, Optimal estimation of the mean function based on discretely sampled functional data: Phase transition, Ann Stat, 39, 2330, 10.1214/11-AOS898

Chiou, 2007, Diagnostics for functional regression via residual processes, Comput Stat Data Anal, 51, 4849, 10.1016/j.csda.2006.07.042

Crainiceanu, 2004, Likelihood ratio tests in linear mixed models with one variance component, J R Stat Soc, 66, 165, 10.1111/j.1467-9868.2004.00438.x

Diggle, 2002, Analysis of Longitudinal Data, 10.1093/oso/9780198524847.001.0001

Fan, 2000, Two-step estimation of functional linear models with application to longitudinal data, J R Stat Soc Series B, 62, 303, 10.1111/1467-9868.00233

Goldsmith, 2011, Penalized functional regression analysis of white-matter tract profiles in multiple sclerosis, Neuroimage, 57, 431, 10.1016/j.neuroimage.2011.04.044

Goldsmith, 2013, Corrected confidence bands for functional data using principal components, Biometrics, 69, 41, 10.1111/j.1541-0420.2012.01808.x

Goldsmith, 2016, refund: Regression with Functional Data

González-Manteiga, 2013, An updated review of goodness-of-fit tests for regression models, Test, 22, 361, 10.1007/s11749-013-0327-5

Greven, 2008, Restricted likelihood ratio testing for zero variance components in linear mixed models, J Comput Graph Stat, 17, 870, 10.1198/106186008X386599

Hardle, 1993, Comparing nonparametric versus parametric regression fits, Ann Stat, 21, 1926, 10.1214/aos/1176349403

James, 2000, Principal component models for sparse functional data, Biometrika, 87, 587, 10.1093/biomet/87.3.587

Kaslow, 1987, The multicenter AIDS cohort study: Rationale, organization, and selected characteristics of the participants, Am J Epidemiol, 126, 310, 10.1093/aje/126.2.310

Kong, 2016, Classical testing in functional linear models, J Nonparametr Stat, 28, 813, 10.1080/10485252.2016.1231806

Laird, 1982, Random-effects models for longitudinal data, Biometrics, 38, 963, 10.2307/2529876

Ledoit, 2002, Some hypothesis tests for the covariance matrix when the dimension is large compared to the sample size, Ann Stat, 30, 1081, 10.1214/aos/1031689018

Lindstrom, 1988, Newton-Raphson and EM algorithms for linear mixed-effects models for repeated-measures data, J Am Stat Assoc, 83, 1014

McLean, 2015, Restricted likelihood ratio tests for linearity in scalar-on-function regression, Stat Comput, 25, 997, 10.1007/s11222-014-9473-1

Peng, 2009, A geometric approach to maximum likelihood estimation of functional principal components from sparse longitudinal data, J Comput Graph Stat, 18, 995, 10.1198/jcgs.2009.08011

Pinheiro, 2017, nlme: Linear and Nonlinear Mixed Effects Models

Ramsay, 2002, Applied Functional Data Analysis

Ramsay, 2005, Functional Data Analysis, 10.1007/b98888

Reich, 2010, Automated vs, Neuroimage, 49, 3047, 10.1016/j.neuroimage.2009.11.043

Ruppert, 2003, Semiparametric Regression, 10.1017/CBO9780511755453

Schiepl, 2016, RLRsim: Exact (Restricted) Likelihood Ratio Tests for Mixed and Additive Models

Self, 1987, Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under nonstandard conditions, J Am Stat Assoc, 82, 605, 10.1080/01621459.1987.10478472

Swihart, 2014, Restricted likelihood ratio tests for functional effects in the functional linear model, Technometrics, 56, 483, 10.1080/00401706.2013.863163

Taylor, 1994, A stochastic model for analysis of longitudinal AIDS data, J Am Stat Assoc, 89, 727, 10.1080/01621459.1994.10476806

Wood, 2003, Thin plate regression splines, J R Stat Soc Series B, 65, 95, 10.1111/1467-9868.00374

Wood, 2017, mgcv: Mixed GAM Computation Vehicle with Automatic Smoothness Estimation

Wu, 1986, Jackknife, bootstrap and other resampling methods in regression analysis, Ann Stat, 14, 1261

Xiao, 2018, Fast covariance estimation for sparse functional data, Stat Comput, 28, 511, 10.1007/s11222-017-9744-8

Yao, 2005, Functional data analysis for sparse longitudinal data, J Am Stat Assoc, 100, 577, 10.1198/016214504000001745

Zeger, 1994, Semiparametric models for longitudinal data with application to CD4 cell numbers in HIV seroconverters, Biometrics, 50, 689, 10.2307/2532783

Zhong, 2017, Tests for covariance structures with high-dimensional repeated measurements, Ann Stat, 45, 1185, 10.1214/16-AOS1481