Exploratory Visual Inspection of Category Associations and Correlation Estimation in Multidimensional Subspaces
Tóm tắt
In this paper, we aimed to estimate associations among categories in a multi-way contingency table. To simplify estimation and interpretation of results, we stacked multiple variables to form a two-way stacked table and analyzed it using the biplot in correspondence analysis (CA) paradigm. The correspondence analysis biplot allowed visual inspection of category associations in a twodimensional plane, and the CA solution numerically estimated the category relationships. We utilized parallel analysis and identified two statistically meaningful dimensions with which a plane was constructed. In the plane, we examined metric space mapping, which was converted into correlations, between school districts and categories of school-relevant variables. The results showed differential correlation patterns among school districts and this correlational information may be useful for stake holders or policy makers to pinpoint possible causes of low school performance and school-relevant behaviors.
Tài liệu tham khảo
BEH, E.J., and LOMBARDO, R. (2014), Correspondence Analysis: Theory, Practice and New Strategies, West Sussex, UK: John Wiley and Sons, Ltd.
BENZÉCRI, J.-P. (1973), L'Analyse des Données Volume 2: Analyse des Correspondances, Paris, France: Dunod.
BENZÉCRI, J.-P. (1982), Histoire et Préhistoire de l’Analyse des Données, Paris: Dunod.
BENZÉCRI, J.-P. (1992), Correspondence Analysis Handbook, New York: Dekker.
BLASIUS, J., and GREENACRE, M. (Eds.) (2014), Visualization and Verbalization of Data, Boca Rata FL: Chapman and Hall/CRC Press.
BRADU, D., and GABRIEL, K.R. (1978), “The Biplot as a Diagnostic Tool for Models of Two-Way Tables”, Technometrics, 20, 47–68.
GABRIEL, K.R. (1971), “The Biplot Graphic Display of Matrices with Application to Principal Component Analysis”, Biometrika, 58 (3), 453–467.
GABRIEL, K.R. (1981), “Biplot Display of Multivariate Matrices for Inspection of Data and Diagnosis”, in Interpreting Multivariate Data, ed. V. Barnett, New York: Wiley, pp. 147–173.
GABRIEL K.R. (2002), “Goodness of Fit Biplots and Correspondence Analysis”, Biometrika, 89, 423–436.
GABRIEL, K.R., and ODOROFF, C.I. (1990), “Biplots in Biomedical Research”, Statistics in Medicine, 9, 469–485.
GIFI, A. (1990), Nonlinear Multivariate Analysis, Chichester, UK: Wiley.
GOWER, J.C., and HAND, D.J. (1996), Biplots, London, UK: Chapman and Hall.
GREENACRE, M.J. (1984), Theory and Applications of Correspondence Analysis, London: Academic Press.
GREENACRE, M.J. (1993), “Biplots in Correspondence Analysis”, Journal of Applied Statistics, 20, 251–269.
GREENACRE, M.J. (2010), Biplots in Practice, Madrid: Fundación BBVA, pp 51–59.
GREENACRE, M.J. (2007), Correspondence Analysis in Practice (2nd ed.), Boca Raton, FL: Chapman and Hall/CRC.
GRENFELL, M., and LEBARON, F. (Eds.) (2014), Bourdieu and Data Analysis, Methodological Principles and Practice, Bern: Peter Lang.
HAYASHI, C. (1950), “On the Quantification of Qualitative Data from the Mathematico-Statistical Point of View”, Annals of the Institute of Statistical Mathematics, 2, 35–47.
HAYASHI, C. (1952), “On the Prediction of Phenomena from Qualitative Data and the Quantification of Qualitative Data from The Mathematico-Statistical Point of View”, Annals of the Institute of Statistical Mathematics, 3, 69–98.
LEBART, L., MORINEAU, A, and WARWICK, K.M. (1984), Multivariate Descriptive Statistical Analysis. NY: John Wiley and Sons, Inc.
LEGENDRE, P., and GALLAGHER, E.D. (2001), “Ecologically Meaningful Transformations for Ordination of Species Data”, Oecologia, 129, 271–280.
LE ROUX, B., and ROUANET, H. (2004), Geometric Data Analysis: From Correspondence Analysis to Structured Data. Dordrecht: Kluwer.
LÊ, S., JOSSE, J., and HUSSON, F. (2008), “FactoMineR: An R Package for Multivariate Analysis”, Journal of Statistical Software, 25, 1–18.
LORENZO-SEVA, U. (2011), “Horn’s Parallel Analysis for Selecting the Number of Dimensions in Correspondence Analysis”, Methodology European Journal of Research Methods for the Behavioral and Social Sciences, 7, 96–102.
KIM, S.-K., MCKAY, D., TAYLOR, S., TOLIN, D.F., OLATUNJI, B.O., TIMPANO, K.R., and ABRAMOWITZ, J.S. (2016), “The Structure of Obsessive Compulsive Symptoms and Beliefs: A Correspondence and Biplot Analysis”, Journal of Anxiety Disorders, 38, 79–87.
MURTAGH, F. (2005), Correspondence Analysis and Data Coding with Java and R, Boca Raton, FL: Chapman and Hall/CRC.
NEW YORK CITY SCHOOL DISTRICTS (2011), http://schools.nyc.gov/
NISHISATO, S. (1980), Analysis of Categorical Data: Dual Scaling and Its Applications, Toronto: University of Toronto Press.
NISHISATO, S. (1994), Elements of Dual Scaling: An Introduction to Practical Data Analysis, Hilsdale, NJ: Lawrence Erlbaum Associates.
NISHISATO, S. (2007), Multidimensional Nonlinear Descriptive Analysis, Boca Raton, FL: Chapman and Hall/CRC.
PLACKETT, R.L. (1983), “Karl Pearson and the Chi-Squared Test”, International Statistical Review, 51 (1), 59–72.
TER BRAAK, C.J.F. (1983), “Principal Components Biplots and Alpha and Beta Diversity”, Ecology, 64, 454–462.
TER BRAAK, C.J.F. (1990), “Interpreting Canonical Correlation Analysis Through Biplots of Structure Correlations and Weights”, Psychometrika, 55 (3), 519–531.
TER BRAAK, C.J.F., and VERDONSCHOT, P.E.M. (1995), “Canonical Correspondence Analysis and Related Multivariate Methods in Aquatic Ecology”, Aquatic Sciences, 57(3), 255–289.