How to Address Non-normality: A Taxonomy of Approaches, Reviewed, and Illustrated

Jolynn Pek1, Octavia Wong2, Augustine Wong2,3
1Psychology, The Ohio State University, United States
2Kinesiology and Health Sciences, York University, Canada
3Mathematics and Statistics, York University, Canada

Tóm tắt

Từ khóa


Tài liệu tham khảo

Abelson, 1995, Statistics as Principled Argument.

Aron, 2010, Statistics for the Behavioral and Social Sciences: A Brief Course, 5th Edn

Aron, 2012, Statistics for Psychology, 6th Edn

Baayen, 2017, The cave of shadows: addressing the human factor with generalized additive mixed models, J. Mem. Lang., 94, 206, 10.1016/j.jml.2016.11.006

Baguley, 2012, Serious Stats: A Guide to Advanced Statistics for the Behavioral Sciences, 10.1007/978-0-230-36355-7

Barnett, 1998, Outliers in Statistical Data, 3rd Edn

Bartlett, 1947, The use of transformations, Biometrics, 3, 39, 10.2307/3001536

Beaton, 1974, The fitting of power series, meaning polynomials, illustrated on band-spectroscopic data, Technometrics, 16, 147, 10.2307/1267936

Beins, 2012, Research Methods and Statistics, 1st Edn

Belhekar, 2016, Statistics for Psychology Using R, 10.4135/9789353282493

Bland, 1996, Transformations, means, and confidence intervals, BMJ, 312, 1079, 10.1136/bmj.312.7038.1079

Bono, 2017, Non-normal distributions commonly used in health, education, and social sciences: a systematic review, Front. Psychol., 8, 1602, 10.3389/fpsyg.2017.01602

Box, 1964, An analysis of transformations (with discussion), J. R. Stat. Soc. Ser. B, 26, 211, 10.1111/j.2517-6161.1964.tb00553.x

Breiman, 2001, Statistical modeling: the two cultures (with discussion), Stat. Sci., 16, 199, 10.1214/ss/1009213726

Breiman, 1985, Estimating optimal transformations for multiple regression and correlation, J. Am. Stat. Assoc., 80, 580, 10.2307/2288473

Brysbaert, 2011, Basic Statistics for Psychologists, 10.1007/978-0-230-34592-8

Cain, 2017, Univariate and multivariate skewness and kurtosis for measuring nonnormality: prevalence, influence and estimation, Behav. Res. Methods, 49, 1716, 10.3758/s13428-016-0814-1

Case, 2012, Exploring the World Through Social Statistics

Christopher, 2016, Interpreting and Using Statistics in Psychological Research

Cohen, 2004, Essentials of Statistics for the Social and Behavioral Sciences

Cohen, 2003, Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences, 3rd Edn

Coolican, 2014, Research Methods and Statistics in Psychology, 6th Edn

Cribari-Neto, 2004, Asymptotic inference under heteroskedasticity of unknown form. Computational, Stat. Data Anal., 45, 215, 10.1016/s0167-9473(02)00366-3

Cribari-Neto, 2011, A new heteroskedasticity-consistent covariance matrix estimator for the linear regression model, AStA Adv. Stat. Anal., 95, 129, 10.1007/s10182-010-0141-2

Cribari-Neto, 2007, Inference under heteroskedasticity and leveraged data, Commun. Stat. Theor. Methods, 36, 1877, 10.1080/03610920601126589

Darlington, 2017, Regression Analysis and Linear Models: Concepts, Applications and Implementation

Davidson, 1993, Estimation and Inference in Econometrics.

Davis, 2005, Introduction to Statistics and Research Methods: Becoming a Psychological Detective

De Veaux, 2015, Stats: Data and Models, 2nd Edn

Duan, 1983, Smearing estimate: a nonparametric retransformation method, J. Am. Stat. Assoc., 78, 605, 10.2307/2288126

Dudgeon, 2017, Some improvements in confidence intervals for standardized regression coefficients, Psychometrika, 82, 928, 10.1007/s11336-017-9563-z

Efron, 1979, Bootstrap methods: another look at the jackknife, Ann. Stat., 7, 1, 10.1214/aos/1176344552

Efron, 1981, Nonparametric standard errors and confidence intervals, Can. J. Stat., 9, 139, 10.2307/3314608

Efron, 1982, The Jackknife, the Bootstrap and Other Resampling Plans, 10.1137/1.9781611970319

Efron, 1987, Better bootstrap confidence intervals, J. Am. Stat. Assoc., 82, 171, 10.1080/01621459.1987.10478410

Efron, 1993, An Introduction to the Bootstrap, 10.1007/978-1-4899-4541-9

Eicker, 1963, Asymptotic normality and consistency of the least squares estimators for families of linear regressions, Ann. Math. Stat., 34, 447, 10.1214/aoms/1177704156

Eicker, 1967, “Limit theorems for regressions with unequal and dependent errors,”, Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, 59

Evans, 2014, Using Basic Statistics in the Behavioral and Social Sciences, 5th Edn, 10.4135/9781071878859

Field, 2013, Discovering Statistics Using IBM SPSS Statistics

Field, 2012, Discovering Statistics Using R

Fielding, 2006, Understanding Social Statistics, 10.4135/9781446249406

Foster, 2015, Beginning Statistics: An Introduction for Social Scientists, 2nd Edn

Fox, 2008, Applied Regression Analysis and Generalized Linear Models, 2nd Edn

Fox, 2015, Applied Regression Analysis and Generalized Linear Models, 3rd Edn

Fox, 2011, An R Companion to Applied Regression

Gallant, 1987, Nonlinear Statistical Models, 10.1002/9780470316719

Gelman, 2007, Data Analysis Using Regression and Multilevel/Hierarchical Models

Goodwin, 2016, Research in Psychology Methods and Design, 8th Edn

Gordon, 2015, Regression Analysis for the Social Sciences, 10.4324/9781315748788

Gould, 2015, Introductory Statistics: Exploring the World Through Data, 2nd Edn

Gravetter, 2017, Statistics for the Behavioral Sciences, 10 Edn

Ha, 2012, Integrative Statistics for the Social Sciences

Hanna, 2013, Psychology Statistics for Dummies

Harrell, 2015, Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis, 10.1007/978-3-319-19425-7

Haslam, 2014, Research Methods and Statistics in Psychology

Hayes, 2007, Using heteroskedasticity-consistent standard error estimators in OLS regression: an introduction and software implementation, Behav. Res. Methods, 39, 709, 10.3758/bf03192961

Healey, 2016, Statistics: A Tool for Social Research, 3rd Edn

Heiman, , Basic Statistics for the Behavioral Sciences, 7th Edn

Heiman, , Essential Statistics for the Behavioral Sciences

Hinkley, 1977, Jackknifing in unbalanced situations, Technometrics, 19, 285, 10.1080/00401706.1977.10489550

Howell, 2014, Statistics Methods for Psychology, 7th Edn

Howell, 2017, Fundamental Statistics for the Behavioral Sciences, 9th Edn

Howitt, 2014, Introduction to SPSS in Psychology, 6th Edn

Huber, 1964, Robust estimation of a location parameter, Ann. Math. Stat., 35, 73, 10.1214/aoms/1177703732

Huber, 1967, “The behavior of maximum likelihood estimates under nonstandard conditions,”, Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, 221

Hurlburt, 2012, Comprehending Behavioral Statistics

Jaccard, 2009, Statistics for the Behavioral Sciences, 5th Edn

Jackson, 2006, Fundamentals of Statistics for the Social Sciences

Jackson, 2015, Research Methods and Statistics: A Critical Thinking Approach, 5th Edn

Jackson, 2017, Statistics: Plain and Simple, 4th Edn

Judd, 2009, Data Analysis: A Model Comparison Approach, 2nd Edn

Kahane, 2008, Regression Basics, 2nd Edn, 10.4135/9781483385662

Keith, 2006, Multiple Regression and Beyond

King, 2014, How robust standard errors expose methodological problems they do not fix, and what to do about it, Polit. Anal., 23, 159, 10.1093/pan/mpu015

Kirk, 2012, Experimental Design: Procedures for the Behavioral Sciences, 10.1002/9781118133880.hop202001

Koenker, 1978, Regression quantiles, Econometrica, 46, 33, 10.2307/1913643

Krieg, 2012, Statistics and Data Analysis for Social Science

Lange, 1989, Robust statistical modeling using the t distribution, J. Am. Stat. Assoc., 84, 881, 10.2307/2290063

Long, 2000, Using heteroscedasticity consistent standard errors in the linear regression model, Am. Stat., 54, 217, 10.1080/00031305.2000.10474549

MacKinnon, 1985, Some heteroskedasticity-consistent covariance matrix estimators with improved finite sample properties, J. Econometr., 29, 305, 10.1016/0304-4076(85)90158-7

Marmolejo-Ramos, 2015, Automatic detection of discordant outliers via the Uedas method, J. Stat. Distrib. Appl., 2, 8, 10.1186/s40488-015-0031-y

Mavridis, 2008, Detecting outliers in factor analysis using the forward search algorithm, Multivariate Behav. Res., 43, 453, 10.1080/00273170802285909

Maxwell, 2004, Desigining Experiments and Analyzing Data: A Model Comparison Perspective, 2nd Edn

Mayers, 2013, Introduction to Statistics and SPSS in Psychology

McCullagh, 1989, Generalized Linear Models, 2nd Edn, 10.1007/978-1-4899-3242-6

McGrath, 2011, Quantitative Models in Psychology, 10.1037/12316-000

McLachlan, 2004, Finite Mixture Models.

Micceri, 1989, The unicorn, the normal curve, and other improbable creatures, Psychol. Bull., 105, 156, 10.1037/0033-2909.105.1.156

Miles, 2007, Understanding and Using Statistics in Psychology

Mohanty, 2016, Statistics for Behavioral and Social Sciences

Mosteller, 1977, Data Analysis and Regression.

Nolan, 2017, Statistics for the Behavioral Sciences, 4th Edn

OsborneJ. W. Improving your data transformations: applying the Box-Cox transformation. Pract. Assess. Res. Eval. 152010

Pagano, 2012, Understanding Statistics in the Behavioral Sciences, 10th Edn

Pek, 2016, On the relationship between confidence regions and exchangeable weights in multiple linear regression, Multivariate Behav. Res., 51, 719, 10.1080/00273171.2016.1225563

Pek, , Confidence intervals for the mean of non-normal distribution: transform or not to transform, Open J. Stat., 7, 405, 10.4236/ojs.2017.73029

PekJ. WongO. WongA. C. M. Data transformations for inference with linear regression: clarifications and recommendations. Pract. Assess. Res. Eval. 22

Pelham, 2013, Intermediate Statistics: A Conceptual Course, 10.4135/9781071909836

Privitera, 2015, Statistics for the Behavioral Sciences, 2nd Edn

Privitera, 2017, Essential Statistics for the Behavioral Sciences

Rosenthal, 2007, Essentials of Behavioral Research: Methods and Data Analysis

Rousseeuw, 1984, Least median of squares regression, J. Am. Stat. Assoc., 79, 871, 10.2307/2288718

Rubio, 2016, Bayesian linear regression with skew-symmetric error distributions with applications to survival analysis, Stat. Med., 35, 2441, 10.1002/sim.6897

Salkind, 2014, Statistics for People Who (Think They) Hate Statistics, 5th Edn

Sampson, 1974, A tale of two regressions, J. Am. Stat. Assoc., 69, 682, 10.2307/2286002

Shadish, 2002, Experimental and Quasi-Experimental Designs for Generalized Causal Inference

Siegel, 1988, Nonparametric Statistics for the Behavioral Sciences, 2nd Edn.

Stasinopoulos, 2018, GAMLSS: a distributional regression approach, Stat. Model., 18, 248, 10.1177/1471082X18759144

Tabachnick, 2012, Using Multivariate Statistics, 6th Edn

Thode, 2002, Testing for Normality, 10.1201/9780203910894

Tibshirani, 1988, Estimating transformations for regression via additivity and variance stabilization, J. Am. Stat. Assoc., 83, 394, 10.1080/01621459.1988.10478610

Tokunga, 2016, Fundamental Statistics for the Social and Behavioral Sciences

Tukey, 1957, On the comparative anatomy of transformations, Ann. Math. Stat., 28, 602, 10.1214/aoms/1177706875

Tukey, 1969, Analyzing data: sanctification or detective work?, Am. Psychol., 24, 83, 10.1037/h0027108

Tukey, 1963, Less vulnerable confidence and significance procedures for location based on a single sample: Trimming/Winsorization 1, Sankhya Indian J. Stat. Ser. A, 25, 331

Urdan, 2016, Statistics in Plain English, 4th Edn

Vélez, 2015, A new approach to the Box–Cox transformation, Front. Appl. Math. Stat., 1, 12, 10.3389/fams.2015.00012

Waldmann, 2018, Quantile regression: a short story on how and why, Stat. Model., 18, 203, 10.1177/1471082x18759142

Warner, 2013, Applied Statistics, From Bivariate Through Multivariate Techniques, 2nd Edn

Weidman, 2017, The jingle and jangle of emotion assessment: Imprecise measurement, casual scale usage, and conceptual fuzziness in emotion research, Emotion, 17, 267, 10.1037/emo0000226

White, 1980, A heteroskedasticity-consistent covariance matrix estimator and a direct test for heteroskedasticity, Econometrica, 48, 817, 10.2307/1912934

Wilcox, 2017, Introduction to Robust Estimation and Hypothesis Testing, 4th Edn

Wilson, 2016, Research Methods and Statistics: An Integrated Approach

Witte, 2015, Statistics, 11th Edn

Xia, 2018, Robust regression estimation based on low-dimensional recurrent neural networks, IEEE Trans. Neural Netw. Learn. Syst., 99, 1, 10.1109/tnnls.2018.2814824

Zhou, 1997, Confidence intervals for the log-normal mean, Stat. Med., 16, 783, 10.1002/(SICI)1097-0258(19970415)16:7h783::AID-SIM488i3.0.CO;2-2