From multi-view data features to clusters: a unified approach

Artificial Intelligence Review - Tập 56 - Trang 3821-3852 - 2023
Fadi Dornaika1
1Ho Chi Minh City Open University, Ho Chi Minh City, Vietnam

Tóm tắt

Clustering data from multiple sources or views has become an important issue in real-world applications. Graph-based methods take advantage of graphs that encode the local and global structure of the data. Although graph-based methods provide good clustering performance, they need to estimate the view graphs or the consensus graph from the raw data in a separate step. Their performance may be affected by the noisy graphs. To overcome this limitation and promote end-to-end multi-view clustering solutions, this paper presents two end-to-end multi-view clustering solutions starting from the data or their kernel representations. The first proposed solution is based on a single objective function that allows the joint estimation of the graph of each view, the consensus graph, the spectral projection matrices for all views, the soft clustering assignments, and the weights of each view. The second solution uses a different objective function where the links and constraints for the soft clustering assignment matrix use the consensus graph matrix and the consensus spectral projection matrix. Both methods enforce the mutual similarity of each graph and provide a direct clustering result without any additional step. The two proposed methods are tested on several real image and text datasets, which prove their superiority.

Tài liệu tham khảo

Cao X, Zhang C, Fu H, Liu S, Zhang H (2015) Diversity-induced multi-view subspace clustering. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 586–594 Chen M-S, Huang L, Wang C-D, Huang D (2020) Multi-view clustering in latent embedding space. Proc AAAI Conf Artif Intell 34:3513–3520 Chen P, Liu L, Ma Z, Kang Z (2021) Smoothed multi-view subspace clustering. International conference on neural computing for advanced applications. Springer, pp 128–140 El Hajjar S, Dornaika F, Abdallah F (2021) Multi-view spectral clustering via constrained nonnegative embedding. Inf Fusion El Hajjar S, Dornaika F, Abdallah F (2022) One-step multi-view spectral clustering with cluster label correlation graph. Inf Sci El Hajjar S, Dornaika F, Abdallah F, Barrena, N (2022) Consensus graph and spectral representation for one-step multi-view kernel based clustering. Knowl-Based Syst: 108250 Greene D, Cunningham P (2009) A matrix factorization approach for integrating multiple data views. In: Joint European conference on machine learning and knowledge discovery in databases. Springer, pp. 423–438 Guo W, Shi Y, Wang S (2019) A unified scheme for distance metric learning and clustering via rank-reduced regression. IEEE Trans Syst Man Cybern: 1–12 Horie M, Kasai H (2021) Consistency-aware and inconsistency-aware graph-based multi-view clustering. In: 2020 28th European signal processing conference (EUSIPCO). IEEE, pp. 1472–1476 Hu Z, Nie F, Wang R, Li X (2020) Multi-view spectral clustering via integrating nonnegative embedding and spectral embedding. Inf Fusion 55:251–259 Hu Z, Nie F, Chang W, Hao S, Wang R, Li X (2020) Multi-view spectral clustering via sparse graph learning. Neurocomputing 384:1–10 Huang H-C, Chuang Y-Y, Chen C-S (2012) Affinity aggregation for spectral clustering. In: 2012 IEEE conference on computer vision and pattern recognition. IEEE, pp. 773–780 Huang S, Kang Z, Tsang IW, Xu Z (2019) Auto-weighted multi-view clustering via kernelized graph learning. Pattern Recogn 88:174–184 Huang S, Kang Z, Xu Z (2020) Auto-weighted multi-view clustering via deep matrix decomposition. Pattern Recogn 97:107015 Huang Z, Ren Y, Pu X, Pan L, Yao D, Yu G (2021) Dual self-paced multi-view clustering. Neural Netw 140:184–192 Huang D, Wang C-D, Peng H, Lai J, Kwoh C-K (2021) Enhanced ensemble clustering via fast propagation of cluster-wise similarities. IEEE Trans Syst Man Cybern 51(1):508–520 Hubert L, Arabie P (1985) Comparing partitions. J Classification 2(1):193–218 Kang Z, Lin Z, Zhu X, Xu W (2021) Structured graph learning for scalable subspace clustering: from single view to multiview. IEEE Trans Cybern 52:8976–8986 Kerenidis I, Landman J (2021) Quantum spectral clustering. Phys Rev A 103(4):042415 Kumar A, Daume H (2011) A co-training approach for multi-view spectral clustering. In: Proceedings of the 28th international conference on international conference on machine learning. ICML’11. Omnipress, Madison, WI, USA, pp. 393–400 Li Z, Nie F, Chang X, Nie L, Zhang H, Yang Y (2018) Rank-constrained spectral clustering with flexible embedding. IEEE Trans Neural Netw Learn Syst 29(12):6073–6082 Lin L, Tang C, Dong G, Chen Z, Pan Z, Liu J, Yang Y, Shi J, Ji R, Hong W (2021) Spectral clustering to analyze the hidden events in single-molecule break junctions. J Phys Chem C 125(6):3623–3630 Liu K, Li X, Zhu Z, Brand L, Wang H (2021) Factor-bounded nonnegative matrix factorization. ACM Trans Knowl Discov Data 15(6):1–18 Lu H, Liu S, Wei H, Chen C, Geng X (2021) Deep multi-kernel auto-encoder network for clustering brain functional connectivity data. Neural Netw 135:148–157 Ma J, Zhang Y, Zhang L (2021) Discriminative subspace matrix factorization for multiview data clustering. Pattern Recogn 111:107676 Nie F, Cai G, Li J, Li X (2017) Auto-weighted multi-view learning for image clustering and semi-supervised classification. IEEE Trans Image Process 27(3):1501–1511 Nie F, Cai G, Li X (2017) Multi-view clustering and semi-supervised classification with adaptive neighbours. In: Thirty-first AAAI conference on artificial intelligence (2017) Nie F, Li J, Li X, et al (2016) Parameter-free auto-weighted multiple graph learning: A framework for multiview clustering and semi-supervised classification. In: IJCAI, pp. 1881–1887 Nie F, Li J, Li X, et al (2017) Self-weighted multiview clustering with multiple graphs. In: Proceedings of the twenty-sixth international joint conference on artificial intelligence (IJCAI-17) Nie F, Tian L, Li X (2018) Multiview clustering via adaptively weighted procrustes. In: Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, pp. 2022–2030 Nie F, Wang X, Jordan MI, Huang H (2016) The constrained Laplacian rank algorithm for graph-based clustering. In: AAAI, pp. 1969–1976 Pan E, Kang Z (2023) High-order multi-view clustering for generic data. Inf Fusion 100:101947 Ren Z, Sun Q (2021) Simultaneous global and local graph structure preserving for multiple kernel clustering. IEEE Trans Neural Netw Learn Syst 32(5):1839–1851. https://doi.org/10.1109/TNNLS.2020.2991366 Sellami L, Alaya B (2021) Samnet: self-adaptative multi-kernel clustering algorithm for urban vanets. Veh Commun 29:100332 Sharma KK, Seal A (2021) Multi-view spectral clustering for uncertain objects. Inf Sci 547:723–745 Sun G, Cong Y, Dong J, Liu Y, Ding Z, Yu H (2021) What and how: generalized lifelong spectral clustering via dual memory. IEEE Trans Pattern Anal Mach Intell Trigeorgis G, Bousmalis K, Zafeiriou S, Schuller BW (2016) A deep matrix factorization method for learning attribute representations. IEEE Trans Pattern Anal Mach Intell 39(3):417–429 Tzortzis G (2012) Likas A Kernel-based weighted multi-view clustering. In: 2012 IEEE 12th international conference on data mining, IEEE, pp. 675–684 Van der Maaten L, Hinton G (2008) Visualizing data using T-SNE. J Mach Learn Res 9(11) Von Luxburg U (2007) A tutorial on spectral clustering. Stat Comput 17(4):395–416 Wang D, Li T, Deng P, Liu J, Huang W, Zhang F (2023) A generalized deep learning algorithm based on NMF for multi-view clustering. IEEE Trans Big Data 9(1):328–340 Wang Q, He X, Jiang X, Li X (2020) Robust bi-stochastic graph regularized matrix factorization for data clustering. IEEE transactions on pattern analysis and machine intelligence White M, Yu Y, Zhang X, Schuurmans D (2012) Convex multi-view subspace learning. In: Nips. Lake Tahoe, Nevada, pp. 1682–1690 Wu Z, Liu S, Ding C, Ren Z, Xie S (2019) Learning graph similarity with large spectral gap. IEEE Trans Syst Man Cybern Xu Y-M, Wang C-D, Lai J-H (2016) Weighted multi-view clustering with feature selection. Pattern Recogn 53:25–35 Yang Z, Liang N, Yan W, Li Z, Xie S (2020) Uniform distribution non-negative matrix factorization for multiview clustering. IEEE Trans Cybern: 1–14 Yuan C, Zhu Y, Zhong Z, Zheng W, Zhu X (2022) Robust self-tuning multi-view clustering. World Wide Web 25(2):489–512 Zhan K, Zhang C, Guan J, Wang J (2017) Graph learning for multiview clustering. IEEE Trans Cybern 48(10):2887–2895 Zhan K, Nie F, Wang J, Yang Y (2019) Multiview consensus graph clustering. IEEE Trans Image Process 28(3):1261–1270 Zhang G-Y, Zhou Y-R, He X-Y, Wang C-D, Huang D (2020) One-step kernel multi-view subspace clustering. Knowl-Based Syst 189:105126 Zhong G, Pun C-M (2023) Self-taught multi-view spectral clustering. Pattern Recogn 138:109349 Zhu X, Zhang S, He W, Hu R, Lei C, Zhu P (2019) One-step multi-view spectral clustering. IEEE Trans Knowl Data Eng 31(10):2022–2034. https://doi.org/10.1109/TKDE.2018.2873378 Zhu X, Zhang S, Zhu Y, Zheng W, Yang Y (2020) Self-weighted multi-view fuzzy clustering. ACM Trans Knowl Discov Data 14(4):1–17 Zhu W, Nie F, Li X (2017) Fast spectral clustering with efficient large graph construction. In: 2017 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp. 2492–2496