Multiclass classification by Min–Max ECOC with Hamming distance optimization

The Visual Computer - Tập 39 - Trang 3949-3961 - 2022
Gábor Szűcs1
1Department of Telecommunications and Media Informatics, Budapest University of Technology and Economics, Budapest, Hungary

Tóm tắt

Two questions often arise in the field of the ensemble in multiclass classification problems, (i) how to combine base classifiers and (ii) how to design possible binary classifiers. Error-correcting output codes (ECOC) methods answer these questions, but they focused on only the general goodness of the classifier. The main purpose of our research was to strengthen the bottleneck of the ensemble method, i.e., to minimize the largest values of two types of error ratios in the deep neural network-based classifier. The research was theoretical and experimental, the proposed Min–Max ECOC method suggests a theoretically proven optimal solution, which was verified by experiments on image datasets. The optimal solution was based on the maximization of the lowest value in the Hamming matrix coming from the ECOC matrix. The largest ECOC matrix, the so-called full matrix is always a Min–Max ECOC matrix, but smaller matrices generally do not reach the optimal Hamming distance value, and a recursive construction algorithm was proposed to get closer to it. It is not easy to calculate optimal values for large ECOC matrices, but an interval with upper and lower limits was constructed by two theorems, and they were proved. Convolutional Neural Networks with Min–Max ECOC matrix were tested on four real datasets and compared with OVA (one versus all) and variants of ECOC methods in terms of known and two new indicators. The experimental results show that the suggested method surpasses the others, thus our method is promising in the ensemble learning literature.

Tài liệu tham khảo

Alvear-Sandoval, R.F., Sancho-Gómez, J.L., Figueiras-Vidal, A.R.: On improving CNNs performance: the case of MNIST. Inf. Fusion 52, 106–109 (2019). https://doi.org/10.1016/j.inffus.2018.12.005

Chaladze, G., Kalatozishvili, L.: Linnaeus 5 dataset for machine learning (2017)

He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, pp. 770–778 (2016). https://doi.org/10.1109/CVPR.2016.90

Lei, L., Song, Y.: Weighted decoding for the competence reliability problem of ECOC multiclass classification. Comput. Intell. Neurosci. 2021, Article ID 5583031, 11 pp (2021). https://doi.org/10.1155/2021/5583031

Mehra, N., Gupta, S.: Survey on multiclass classification methods. Int. J. Comput. Sci. Inf. Technol. (IJCSIT) 4(4), 572–576 (2013)

Zou, J.Y., Sun, M.X., Liu, K.H., Wu, Q.Q.: The design of dynamic ensemble selection strategy for the error-correcting output codes family. Inf. Sci. (2021). https://doi.org/10.1016/j.ins.2021.04.038