An ordered clustering algorithm based on K-means and the PROMETHEE method

Liuhao Chen1, Zeshui Xu2, Hai Wang3, Shousheng Liu4
1College of Communications Engineering, PLA University of Science and Technology, Nanjing, China
2Business School, Sichuan University, Chengdu, China
3School of Economics and Management, Southeast University, Nanjing, China
4College of Sciences, PLA University of Science and Technology, Nanjing, China

Tóm tắt

The multi-criteria decision aid (MCDA) has been a fast growing area of operational research and management science during the past two decades. The clustering problem is one of the well-known MCDA problems, in which the K-means clustering algorithm is one of the most popular clustering algorithms. However, the existing versions of the K-means clustering algorithm are only used for partitioning the data into several clusters which don’t have priority relations. In this paper, we propose a complete ordered clustering algorithm called the ordered K-means clustering algorithm, which considers the preference degree between any two alternatives. Different from the K-means clustering algorithm, we apply the relative net flow of PROMETHEE to measure the closeness of alternatives. In this case, the ordered K-means clustering algorithm can capture the different importance degrees of criteria. At last, we employ the proposed algorithm to solve a practical ordered clustering problem concerning the human development indexes. Then a comparison analysis with an existing approach is conducted to demonstrate the advantages of the ordered K-means clustering algorithm.

Tài liệu tham khảo

Xu R, Wunsch D (2005) Survey of clustering algorithms. Neural Netw IEEE Trans 16:645–678 Filippone M, Camastra F, Masulli F, Rovetta S (2008) A survey of kernel and spectral methods for clustering. Pattern Recogn 41:176–190 Berkhin P (2006) A survey of clustering data mining techniques. Grouping multidimensional data. Springer, Berlin, pp 25–71 Baraldi A, Blonda P (1999) A survey of fuzzy clustering algorithms for pattern recognition. I. Syst Man Cybern Part B Cybern IEEE Trans 29:778–785 Xu ZS (2013) Intuitionistic fuzzy aggregation and clustering. Springer, Berlin Jain AK, Dubes RC (1988) Algorithms for clustering data. Prentice-Hall Inc, USA Melnykov I, Melnykov V (2014) On K-means algorithm with the use of Mahalanobis distances. Stat Probab Lett 84:88–95 Bezdek JC (1981) Pattern recognition with fuzzy objective function algorithms. Plenum, New York Hai-Yan T, Bao D, Qi WG (2009) Research on traffic mode of elevator applied fuzzy C-mean clustering algorithm based on PSO. In: Measuring technology and mechatronics automation, 2009. ICMTMA’09. International Conference on, IEEE, pp 582–585 Havens TC, Bezdek JC, Leckie C, Hall LO, Palaniswami M (2012) Fuzzy c-means algorithms for very large data. IEEE Trans Fuzzy Syst 20:1130–1146 Xu ZS, Wu JJ (2010) Intuitionistic fuzzy c-means clustering algorithms. J Syst Eng Electron 21:580–590 Yu DJ, Shi SS (2015) Researching the development of Atanassov intuitionistic fuzzy set: using a citation network analysis. Appl Soft Comput 32:189–198 Chen N, Xu ZS, Xia MM (2014) Hierarchical hesitant fuzzy K-means clustering algorithm. Appl Math J Chin Univ 29(1):1–17 Deng ZY, Zhu XS, Cheng DB, Zong M, Zhang SC (2016) Efficient kNN classification algorithm for big data. Neurocomputing 195:143–148 Mashayekhy L, Nejad MM, Grosu D, Zhang Q, Shi WS (2015) Energy-aware scheduling of mapReduce jobs for big data applications. IEEE Trans Parallel Distrib Syst 26:2720–2733 Bolon-Canedo V, Fernandez-Francos D, Peteiro-Barral D, Alonso-Betanzos A, Guijarro-Berdinas B, Sanchez-Marono N (2016) A unified pipeline for online feature selection and classification. Expert Syst Appl 55:532–545 Duan HC, Peng YB, Min GY, Xiang XK, Zhan WH, Zou H (2015) Distributed in-memory vocabulary tree for real-time retrieval of big data images. Ad Hoc Netw 35:137–148 Wang H, Wu JJ, Yuan S, Chen J (2016) On characterizing scale effect of Chinese mutual funds via text mining. Sig Process 124:266–278 Jesse S, Chi M, Belianinov A, Beekman C, Kalinin SV, Borisevich AY, Lupini AR (2016) Big data analytics for scanning transmission electron microscopy ptychography. Sci Rep 6:26348 Soheily-Khah S, Douzal-Chouakria A, Gaussier E (2016) Generalized k-means-based clustering for temporal data under weighted and kernel time warp. Pattern Recogn Lett 75:63–69 De Smet Y, Nemery P, Selvaraj R (2012) An exact algorithm for the multicriteria ordered clustering problem. Omega 40:861–869 Brans JP, Mareschal B (2005) PROMETHEE methods. Multiple criteria decision analysis: state of the art surveys. Springer, Berlin, pp 163–186 Liao HC, Xu ZS (2014) Multi-criteria decision making with intuitionistic fuzzy PROMETHEE. J Intell Fuzzy Syst 27:1703–1717 Chen LH, Xu ZS (2015) A new prioritized multi-criteria outranking method: the prioritied PROMETHEE. J Intell Fuzzy Syst 29:2099–2110 Yu XH, Xu ZS, Ma Y (2013) Prioritized multi-criteria decision making based on the Idea of PROMETHEE. Proced Comp Sci 17:449–456 Bezdek J, Hathaway R (2003) Convergence of alternating optimization. Nueral Parallel Sci Comput 11:351–368 Brans JP, Vincke P (1985) A preference ranking organization method. The PROMETHEE method for multiple criteria decision-making. Manag Sci 31:647–656 De Smet Y, Gilbart F (2001) A cluster definition method for country risk problems. Technical Report SMG, IS-MG 2001/13 Noorbakhsh F (1998) The human development index: some technical issues and alternative indices. J Int Dev 10:589–605 Noorbakhsh F (1996) Some reflections on the UNDP’s human development index. Centre for Development Studies, University of Glasgow, Scotland Trabold-Nübler H (1991) The human development index–A new development indicator? Intereconomics 26:236–243 Kelley AC (1991) The human development index: “handle with care”. Popul Dev Rev 17:315–324 Behzadian M, Kazemzadeh RB, Albadvi A, Aghdasi M (2010) PROMETHEE: a comprehensive literature review on methodologies and applications. Eur J Oper Res 200:198–215 Figueira JR, Greco S, Roy B, Słowiński R (2013) An overview of ELECTRE methods and their recent extensions. J Multi Criteria Decis Anal 20:61–85