Generalized Additive Models for Location, Scale and Shape

Robert A. Rigby1, D. M. Stasinopoulos1
1London Metropolitan University, UK

Tóm tắt

Summary

A general class of statistical models for a univariate response variable is presented which we call the generalized additive model for location, scale and shape (GAMLSS). The model assumes independent observations of the response variable y given the parameters, the explanatory variables and the values of the random effects. The distribution for the response variable in the GAMLSS can be selected from a very general family of distributions including highly skew or kurtotic continuous and discrete distributions. The systematic part of the model is expanded to allow modelling not only of the mean (or location) but also of the other parameters of the distribution of y, as parametric and/or additive nonparametric (smooth) functions of explanatory variables and/or random-effects terms. Maximum (penalized) likelihood estimation is used to fit the (non)parametric models. A Newton–Raphson or Fisher scoring algorithm is used to maximize the (penalized) likelihood. The additive terms in the model are fitted by using a backfitting algorithm. Censored data are easily incorporated into the framework. Five data sets from different fields of application are analysed to emphasize the generality of the GAMLSS class of models.

Từ khóa


Tài liệu tham khảo

Aitkin, 1999, A general maximum likelihood analysis of variance components in generalized linear models, Biometrics, 55, 117, 10.1111/j.0006-341X.1999.00117.x

Akaike, 1974, A new look at the statistical model identification, IEEE. Trans. Autom. Control, 19, 716, 10.1109/TAC.1974.1100705

Akaike, 1983, Information measures and model selection, Bull. Int. Statist. Inst., 50, 277

Benjamin, 2003, Generalized Autoregressive Moving Average Models, J. Am. Statist. Ass., 98, 214, 10.1198/016214503388619238

Berger, 1985, Statistical Decision Theory and Bayesian Analysis, 10.1007/978-1-4757-4286-2

Besag, 1999, Bayesian analysis of agriculture field experiments (with discussion), J. R. Statist. Soc, 61, 691, 10.1111/1467-9868.00201

Besag, 1991, Bayesian image restoration, with applications in spatial statistics (with discussion), Ann. Inst. Statist. Math., 43, 1, 10.1007/BF00116466

de Boor, 1978, A Practical Guide to Splines, 10.1007/978-1-4612-6333-3

Box, 1964, An analysis of transformations (with discussion), J. R. Statist. Soc, 26, 211

Box, 1973, Bayesian Inference in Statistical Analysis

Breslow, 1993, Approximate inference in generalized linear mixed models, J. Am. Statist. Ass., 88, 9

Breslow, 1995, Bias correction in generalized linear mixed models with a single component of dispersion, Biometrika, 82, 81, 10.1093/biomet/82.1.81

Claeskens, 2003, The focused information criterion, J. Am. Statist. Ass., 98, 900, 10.1198/016214503000000819

Cleveland, 1993, Statistical Modelling in S, 309

Cole, 1998, British 1990 growth reference centiles for weight, height, body mass index and head circumference fitted by maximum penalized likelihood, Statist. Med., 17, 407, 10.1002/(SICI)1097-0258(19980228)17:4<407::AID-SIM742>3.0.CO;2-L

Cole, 1992, Smoothing reference centile curves: the LMS method and penalized likelihood, Statist. Med., 11, 1305, 10.1002/sim.4780111005

Cole, 1999, Centiles of body mass index for Dutch children age 0-20 years in 1980—a baseline to assess recent trends in obesity, Ann. Hum. Biol., 26, 303, 10.1080/030144699282999

Cox, 1987, Parameter orthogonality and approximate conditional inference (with discussion), J. R. Statist. Soc. B, 49, 1

Crisp, 1994, A note on nonregular likelihood functions in heteroscedastic regression models, Biometrika, 81, 585, 10.1093/biomet/81.3.585

CYTEL Software Corporation, 2001, EGRET for Windows

Diggle, 2002, Analysis of Longitudinal Data, 2nd, 10.1093/oso/9780198524847.001.0001

Draper, 1995, Assessment and propagation of model uncertainty (with discussion), J. R. Statist. Soc, 57, 45

Dunn, 1996, Randomised quantile residuals, J. Comput. Graph. Statist., 5, 236

Eilers, 1996, Flexible smoothing with B-splines and penalties (with comments and rejoinder), Statist. Sci., 11, 89, 10.1214/ss/1038425655

Fahrmeir, 2001, Bayesian inference for generalized additive mixed models based on Markov random field priors, Appl. Statist., 50, 201

Fahrmeir, 2001, Multivariate Statistical Modelling based on Generalized Linear Models, 2nd, 10.1007/978-1-4757-3454-6

Gange, 1996, Use of the beta–binomial distribution to model the effect of policy changes on appropriateness of hospital stays, Appl. Statist., 45, 371, 10.2307/2986094

Green, 1985, Linear models for field trials, smoothing and cross-validation, Biometrika, 72, 527, 10.1093/biomet/72.3.527

Green, 1994, Nonparametric Regression and Generalized Linear Models, 10.1007/978-1-4899-4473-3

Harvey, 1989, Forecasting Structural Time Series Models and the Kalman Filter

Hastie, 1990, Generalized Additive Models

Hastie, 1993, Varying-coefficient models (with discussion), J. R. Statist. Soc, 55, 757

Hastie, 2000, Bayesian backfitting, Statist. Sci., 15, 213

Hastie, 2001, The Elements of Statistical Learning: Data Mining, Inference and Prediction, 10.1007/978-0-387-21606-5

Hjort, 2003, Frequentist model average estimation, J. Am. Statist. Ass., 98, 879, 10.1198/016214503000000828

Hodges, 1998, Some algebra and geometry for hierarchical models, applied to diagnostics (with discussion), J. R. Statist. Soc, 60, 497, 10.1111/1467-9868.00137

Hodges, 2001, Counting degrees of freedom in hierarchical and other richly-parameterised models, Biometrika, 88, 367, 10.1093/biomet/88.2.367

Ihaka, 1996, R: a language for data analysis and graphics, J. Computnl Graph. Statist., 5, 299

Johnson, 1949, Systems of frequency curves generated by methods of translation, Biometrika, 36, 149, 10.1093/biomet/36.1-2.149

Johnson, 1994, Continuous Univariate Distributions, 2nd

Johnson, 1995, Continuous Univariate Distributions, 2nd

Johnson, 1993, Univariate Discrete Distributions, 2nd

Kohn, 1998, Bayesian Analysis of Time Series and Dynamic Models, 393

Kohn, 1991, The performance of cross-validation and maximum likelihood estimators of spline smoothing parameters, J. Am. Statist. Ass., 86, 1042, 10.1080/01621459.1991.10475150

Lange, 1999, Numerical Analysis for Statisticians

Lange, 1989, Robust statistical modelling using the t distribution, J. Am. Statist. Ass., 84, 881

Lee, 1996, Hierarchical generalized linear models (with discussion), J. R. Statist. Soc, 58, 619

Lee, 2000, Two ways of modelling overdispersion in non-normal data, Appl. Statist., 49, 591

Lee, 2001, Hierarchical generalised linear models: a synthesis of generalised linear models, random-effect models and structured dispersions, Biometrika, 88, 987, 10.1093/biomet/88.4.987

Lee, 2001, Modelling and analysing correlated non-normal data, Statist. Modllng, 1, 3, 10.1177/1471082X0100100102

Lin, 1999, Inference in generalized additive mixed models by using smoothing splines, J. R. Statist. Soc, 61, 381, 10.1111/1467-9868.00183

Lopatatzidis, 2000, Nonparametric quantile regression using the gamma distribution

Madigan, 1994, Model selection and accounting for model uncertainty in graphical models using Occam’s window, J. Am. Statist. Ass., 89, 1535, 10.1080/01621459.1994.10476894

McCulloch, 1997, Maximum likelihood algorithms for generalized linear mixed models, J. Am. Statist. Ass., 92, 162, 10.1080/01621459.1997.10473613

Nelder, 1972, Generalized linear models, J. R. Statist. Soc, 135, 370

Nelson, 1991, Conditional heteroskedasticity in asset returns: a new approach, Econometrica, 59, 347, 10.2307/2938260

Ortega, 1970, Iterative Solution of Nonlinear Equations in Several Variables

Pawitan, 2001, In All Likelihood: Statistical Modelling and Inference using Likelihood, 10.1093/oso/9780198507659.001.0001

Raftery, 1996, Approximate Bayes factors and accounting for model uncertainty in generalised linear models, Biometrika, 83, 251, 10.1093/biomet/83.2.251

Raftery, 1999, Bayes Factors and BIC: comment on ‘A critique of the Bayesian Information Criterion for model selection’, Sociol. Meth. Res., 27, 411, 10.1177/0049124199027003005

Reinsch, 1967, Smoothing by spline functions, Numer. Math., 10, 177, 10.1007/BF02162161

Rigby, 1996, A semi-parametric additive model for variance heterogeneity, Statist. Comput., 6, 57, 10.1007/BF00161574

Rigby, 1996, Statistical Theory and Computational Aspects of Smoothing, 215, 10.1007/978-3-642-48425-4_16

Rigby, 2004, Technical Report 01/04

Rigby, 2004, Smooth centile curves for skew and kurtotic data modelled using the Box-Cox Power Exponential distribution, Statist. Med., 23, 3053, 10.1002/sim.1861

Ripley, 1996, Pattern Recognition and Neural Networks, 10.1017/CBO9780511812651

Royston, 1994, Regression using fractional polynomials of continuous covariates: parsimonious parametric modelling (with discussion), Appl. Statist., 43, 429, 10.2307/2986270

Schumaker, 1993, Spline Functions: Basic Theory

Schwarz, 1978, Estimating the dimension of a model, Ann. Statist., 6, 461, 10.1214/aos/1176344136

Silverman, 1985, Some aspects of the spline smoothing approach to non-parametric regression curve fitting (with discussion), J. R. Statist. Soc, 47, 1

Smith, 1979, Splines as a useful and convenient statistical tool, Am. Statistn, 33, 57

Speed, 1991, Comment on ‘That BLUP is a good thing: the estimation of random effects’ (by G. K. Robinson), Statist. Sci., 6, 42, 10.1214/ss/1177011930

Stasinopoulos, 1992, Detecting break points in generalised linear models, Comput. Statist. Data Anal., 13, 461, 10.1016/0167-9473(92)90119-Z

Stasinopoulos, 2004, Technical Report 02/04, 10.2172/821786

Stasinopoulos, 2000, Modelling rental guide data using mean and dispersion additive models, Statistician, 49, 479, 10.1111/1467-9884.00247

Thall, 1990, Some covariance models for longitudinal count data with overdispersion, Biometrics, 46, 657, 10.2307/2532086

Tierney, 1986, Accurate approximations for posterior moments and marginal densities, J. Am. Statist. Ass., 81, 82, 10.1080/01621459.1986.10478240

Tong, 1990, Non-linear Time Series, 10.1093/oso/9780198522249.001.0001

Verbyla, 1999, The analysis of designed experiments and longitudinal data by using smoothing splines (with discussion), Appl. Statist., 48, 269

Wahba, 1978, Improper priors, spline smoothing and the problem of guarding against model errors in regression, J. R. Statist. Soc, 40, 364

Wahba, 1985, A comparison of GCV and GML for choosing the smoothing parameter in the generalized spline smoothing problem, Ann. Statist., 4, 1378

Wood, 2000, Modelling and smoothing parameter estimation with multiple quadratic penalties, J. R. Statist. Soc, 62, 413, 10.1111/1467-9868.00240

Wood, 2001, mgcv: GAMs and Generalised Ridge Regression for R, R News, 1, 20

Zeger, 1991, Generalized linear models with random effects: a Gibbs sampling approach, J. Am. Statist. Ass., 86, 79, 10.1080/01621459.1991.10475006

Amoroso, 1925, Ricerche intorno alla curve dei redditi, Ann. Mat. Pura Appl., 2, 123, 10.1007/BF02409935

Azzalini, 1985, A class of distributions which includes the normal ones, Scand. J. Statist., 123, 171

Breslow, Bias correction in generalized linear mixed models with a single component of dispersion, Biometrika, 82, 81, 10.1093/biomet/82.1.81

Cole, Smoothing reference centile curves: the LMS method and penalized likelihood, Statist. Med., 11, 1305, 10.1002/sim.4780111005

Fernandez, 1998, On Bayesian modeling of fat tails and skewness, J. Am. Statist. Ass., 93, 359

Johnson, Continuous Univariate Distributions, 2nd edn.

Jones, 2003, A skew extension of the t-distribution, with applications, J. R. Statist. Soc. B, 65, 159, 10.1111/1467-9868.00378

Lane, 1996, Compstat Proceedings in Computational Statistics, 331

Lee, 2004, Estimating intraclass correlation for binary data using extended quasi-likelihood, Statist. Modllng, 4, 113, 10.1191/1471082X04st070oa

Lee, 1996, Hierarchical generalized linear models (with discussion), J. R. Statist. Soc. B, 58, 619

Lee, 2001, Hierarchical generalised linear models: a synthesis of generalised linear models, random effect models and structured dispersions, Biometrika, 88, 987, 10.1093/biomet/88.4.987

Lee, 2004, Double hierarchical generalized linear models

Little, 2002, Statistical Analysis with Missing Data, 2nd, 10.1002/9781119013563

Longford, 2003, An alternative to model selection in ordinary regression, Statist. Comput., 13, 67, 10.1023/A:1021995912647

Longford, 2005, Modern Analytical Equipment for the Survey Statistician: Missing Data and Small-area Estimation

McCullagh, 1989, Generalized Linear Models, 2nd, 10.1007/978-1-4899-3242-6

Mooney, 2003, Fitting mixtures of von Mises distributions: a case study involving sudden infant death syndrome, Computnl Statist. Data Anal., 41, 505, 10.1016/S0167-9473(02)00181-0

Nandi, 1995, An extension of the generalized Gaussian distribution to include asymmetry, J. Franklin Inst., 332, 67, 10.1016/0016-0032(95)00029-W

Noh, 2004, REML estimation for binary data in GLMMs

Pan, 2004, A comparison of goodness of fit tests for age-related reference ranges, Statist. Med., 23, 1749, 10.1002/sim.1692

R Development Core Team, 2004, R: a Language and Environment for Statistical Computing

Rider, 1958, Generalized Cauchy distributions, Ann. Inst. Statist. Math., 9, 215, 10.1007/BF02892507

Rieck, 1991, A log-linear model for the Birnbaum–Saunders distribution, Technometrics, 33, 51

Rigby, 2004, Box-Cox t distribution for modelling skew and leptokurtotic data

Rigby, Smooth centile curves for skew and kurtotic data modelled using the Box-Cox Power Exponential distribution, Statist. Med., 23, 3053, 10.1002/sim.1861

Stacy, 1962, A generalization of the gamma distribution, Ann. Math. Statist., 33, 1187, 10.1214/aoms/1177704481

Wu, 2003, Optimal design for beta distributed responses