CLUSKEXT: CLUstering model for SKew-symmetric data including EXTernal information

Advances in Data Analysis and Classification - Tập 12 - Trang 43-64 - 2015
Donatella Vicari1
1Dipartimento di Scienze Statistiche, Sapienza University of Rome, Roma, Italy

Tóm tắt

A CLUstering model for SKew-symmetric data including EXTernal information (CLUSKEXT) is proposed, which relies on the decomposition of a skew-symmetric matrix into within and between cluster effects which are further decomposed into regression and residual effects when possible external information on the objects is available. In order to fit the imbalances between objects, the model jointly searches for a partition of objects and appropriate weights which are in turn linearly linked to the external variables. The proposal is fitted in a least-squares framework and a decomposition of the fit is derived. An appropriate Alternating Least-Squares algorithm is provided to fit the model to illustrative real and artificial data.

Tài liệu tham khảo

Bell DR, Lattin JM (1998) Shopping behavior and consumer preference for store price format: Why ‘large basket’ shoppers prefer EDLP. Mark Sci 17:66–88 Borg I, Groenen PJF (1995) Asymmetries in multidimensional scaling. In: Faulbaum F (ed) Softstat ’95. Gustav Fischer, Stuttgart, pp 31–35 Borg I, Groenen P (2005) Modern multidimensional scaling. Theory and applications, 2nd edn. Springer, Berlin Bove, G. (2006) Approaches to asymmetric multidimensional scaling with external information. In: Zani S, Cerioli A, Riani M, Vichi M (eds) Data analysis, classification and forward search. Series: Data analysis, studies in classification data analysis and knowledge organization. Springer, Heidelberg, pp 69–76 Constantine AG, Gower JC (1978) Graphic representations of asymmetric matrices. Appl Stat 27:297–304 Ekman GA (1963) A direct method for multidimensional ratio scaling. Psychometrika 28:3–41 Escoufier Y, Grorud A (1980) Analyse factorielle des matrices carrées non-symétriques. In: Diday E et al (eds) Data analysis and informatics. North Holland, Amsterdam, pp 263–276 Gower JC (1977) The analysis of asymmetry and orthogonality. In: Barra JR, Brodeau F, Romier G, Van Cutsem B (eds) Recent Developments in Statistics. North Holland, Amsterdam, pp 109–123 Kiers HAL, Vicari D, Vichi D (2005) Simultaneous classification and multidimensional scaling with external information. Psychometrika 70:433–470 Liu S, Trenkler G (2008) Hadamard, Khatri-Rao, Kronecker and other matrix products. Int J Inf Syst Sci 4:160–177 Martín-Merino M, Munoz A (2005) Visualizing asymmetric proximities with SOM and MDS models. Neurocomputing 63:171–192 Munoz A, Martín-Merino M (2002) New asymmetric iterative scaling models for the generation of textual word maps, JADT 2002: 6es Journées internationales d’Analyse statistique des Données Textuelles, Saint Malo, France, pp 593–603 Okada A (1990) A generalization of asymmetric multidimensional scaling. In: Schader M, Gaul W (eds) Knowledge, data and computer-assisted decisions. Springer, Berlin, pp 127–138 Okada A, Imaizumi T (1987) Geometric models for asymmetric similarity data. Behaviormetrika 21:81–96 Olszewski D (2012) K-means clustering of asymmetric data. In: Corchado, E. et al. (eds.), Hybrid Artificial Intelligent Systems 2012, Part I, Lecture Notes in Computer Science, vol 7208. Springer, Berlin Heidelberg, pp 243–254 Olszewski D, Ster B (2014) Asymmetric clustering using the alpha-beta divergence. Pattern Recognit 47(5):2031–2041 Rocci R, Bove G (2002) Rotation techniques in asymmetric multidimensional scaling. J Comput Graph Stat 11:405–419 Saito T, Yadohisa H (2005) Data analysis of asymmetric structures. Advanced Approaches in Computational Statistics, Marcel Dekker, New York Vicari D (2014) Classification of asymmetric proximity data. J Classif 31(3):386–420 Zielman B, Heiser WJ (1996) Models for asymmetric proximities. Br J Math Stat Psychol 49:127–146