Generalized Linear Array Models with Applications to Multidimensional Smoothing

Iain D. Currie1, Maŕıa Durbán2, Paul H.C. Eilers3
1Heriot-Watt University, Edinburgh, UK
2Universidad Carlos III de Madrid, Spain
3Leiden University Medical Centre, The Netherlands

Tóm tắt

SummaryData with an array structure are common in statistics, and the design or regression matrix for analysis of such data can often be written as a Kronecker product. Factorial designs, contingency tables and smoothing of data on multidimensional grids are three such general classes of data and models. In such a setting, we develop an arithmetic of arrays which allows us to define the expectation of the data array as a sequence of nested matrix operations on a coefficient array. We show how this arithmetic leads to low storage, high speed computation in the scoring algorithm of the generalized linear model. We refer to a generalized linear array model and apply the methodology to the smoothing of multidimensional arrays. We illustrate our procedure with the analysis of three data sets: mortality data indexed by age at death and year of death, spatially varying microarray background data and disease incidence data indexed by age at death, year of death and month of death.

Từ khóa


Tài liệu tham khảo

Akaike, 1973, Maximum likelihood identification of Gaussian autoregressive moving average models, Biometrika, 60, 255, 10.1093/biomet/60.2.255

de Boor, 1979, Efficient computer manipulation of tensor products, ACM Trans. Math. Softwr., 5, 173, 10.1145/355826.355831

Breslow, 1993, Approximate inference in generalized linear mixed models, J. Am. Statist. Ass., 88, 9

Brewer, 1978, Kronecker products and matrix calculus in system theory, IEEE Trans. Circ. Syst., 25, 772, 10.1109/TCS.1978.1084534

Clayton, 1987, Models for temporal variation in cancer rates: II, Age-period-cohort models, Statist. Med., 6, 469, 10.1002/sim.4780060406

Craven, 1979, Smoothing noisy data with spline functions, Numer. Math., 31, 377, 10.1007/BF01404567

Currie, 2004, Smoothing and forecasting mortality rates, Statist. Modllng, 4, 279, 10.1191/1471082X04st080oa

Dierckx, 1993, Curve and Surface Fitting with Splines, 10.1093/oso/9780198534419.001.0001

Durban, 2005, Multidimensional P-spline mixed models: an efficient method for estimation of multivariate densities

Eilers, 1999, Discussion on ‘The analysis of designed experiments and longitudinal data by using smoothing splines’ (by A. P. Verbyla, B. R. Cullis, M. G. Kenward and S. J. Welham), Appl. Statist., 48, 307

Eilers, 2006, Fast and compact smoothing on large multidimensional grids, Comput. Statist. Data Anal., 50, 61, 10.1016/j.csda.2004.07.008

Eilers, 1996, Flexible smoothing with B-splines and penalties, Statist. Sci., 11, 89, 10.1214/ss/1038425655

Gower, 1982, The Yates algorithm, Util. Math., 21, 99

Green, 1985, Linear models for field trials, smoothing and cross-validation, Biometrika, 72, 527, 10.1093/biomet/72.3.527

Green, 1994, Nonparametric Regression and Generalized Linear Models, 10.1007/978-1-4899-4473-3

Horn, 1991, Topics in Matrix Analysis, 10.1017/CBO9780511840371

Oeppen, 2004

R Development Core Team, 2004, R: a Language and Environment for Statistical Computing

Richards, 2005, The importance of year of birth in two-dimensional mortality data

Ruppert, 2003, Semiparametric Regression, 10.1017/CBO9780511755453

Schwarz, 1978, Estimating the dimension of a model, Ann. Statist., 6, 461, 10.1214/aos/1176344136

Searle, 1982, Matrix Algebra Useful for Statistics

Searle, 1992, Variance Components, 10.1002/9780470316856

Silverman, 1985, Some aspects of the spline smoothing approach to non-parametric regression curve fitting (with discussion), J. R. Statist. Soc. B, 47, 1

Van Loan, 2000, The ubiquitous Kronecker product, J. Comput. Appl. Math., 123, 85, 10.1016/S0377-0427(00)00393-9

Verbyla, 1999, The analysis of designed experiments and longitudinal data by using smoothing splines (with discussion), Appl. Statist., 48, 269

Wahba, 1983, Bayesian ‘‘confidence intervals’’ for the cross-validated smoothing spline, J. R. Statist. Soc. B, 45, 133

Wand, 2003, Smoothing and mixed models, Comput. Statist., 18, 223, 10.1007/s001800300142

Wood, 2000, Modelling and smoothing parameter estimation with multiple quadratic penalties, J. R. Statist. Soc. B, 62, 413, 10.1111/1467-9868.00240

Wood, 2004, R Package Version 1.1-5

Yates, 1937, Technical Communication 35, 1