Appropriateness of performance indices for imbalanced data classification: An analysis

Pattern Recognition - Tập 102 - Trang 107197 - 2020
Sankha Subhra Mullick1, Shounak Datta2, Sourish Gunesh Dhekane3, Swagatam Das1
1Electronics and Communication Sciences Unit, Indian Statistical Institute, Kolkata, India
2Department of Electrical and Computer Engineering, Duke University, Durham, NC, USA
3Department of Computer Science and Engineering, Indian Institute of Information Technology, Guwahati, India

Tài liệu tham khảo

Bishop, 2006 Rumelhart, 1986, Learning representations by back-propagating errors, Nature, 323, 533, 10.1038/323533a0 Sokolova, 2006, Beyond accuracy, f-score and ROC: a family of discriminant measures for performance evaluation, 1015 Das, 2018, Handling data irregularities in classification: foundations, trends, and future challenges, Pattern Recognit., 81, 674, 10.1016/j.patcog.2018.03.008 He, 2013 Haixiang, 2017, Learning from class-imbalanced data: Review of methods and applications, Expert Syst. Appl., 73, 220, 10.1016/j.eswa.2016.12.035 Branco, 2016, A survey of predictive modeling on imbalanced domains, ACM Comput. Surv., 49, 31, 10.1145/2907070 Japkowicz, 2006, Why question machine learning evaluation methods (an illustrative review of the shortcomings of current methods), 6 Buckland, 1994, The relationship between recall and precision, J. Am. Soc. Inf. Sci., 45, 12, 10.1002/(SICI)1097-4571(199401)45:1<12::AID-ASI2>3.0.CO;2-L Kubat, 1997, Addressing the curse of imbalanced training sets: one-sided selection, 97, 179 Hand, 2001, A simple generalisation of the area under the ROC curve for multiple class classification problems, Mach. Learn., 45, 171, 10.1023/A:1010920819831 Davis, 2006, The relationship between Precision-Recall and ROC curves, 233 Huang, 2016, Learning deep representation for imbalanced classification, 5375 N. Japkowicz, Assessment Metrics for Imbalanced Learning, John Wiley & Sons, Ltd, pp. 187–206. Daskalaki, 2006, Evaluation of classifiers for an uneven class distribution problem, Appl. Artif. Intell., 20, 381, 10.1080/08839510500313653 Ferri, 2009, An experimental comparison of performance measures for classification, Pattern Recognit. Lett., 30, 27, 10.1016/j.patrec.2008.08.010 Ballabio, 2018, Multivariate comparison of classification performance measures, Chemom. Intell. Lab. Syst., 174, 33, 10.1016/j.chemolab.2017.12.004 Joshi, 2002, On evaluating performance of classifiers for rare classes, 641 Liu, 2007, A framework for analyzing skew in evaluation metrics, 1 Sokolova, 2007, Performance measures in classification of human communications, 159 Sokolova, 2009, A systematic analysis of performance measures for classification tasks, Inf. Process. Manag., 45, 427, 10.1016/j.ipm.2009.03.002 Brzezinski, 2018, Visual-based analysis of classification measures and their properties for class imbalanced problems, Inf. Sci., 462, 242, 10.1016/j.ins.2018.06.020 Luque, 2019, The impact of class imbalance in classification performance metrics based on the binary confusion matrix, Pattern Recognit., 91, 216, 10.1016/j.patcog.2019.02.023 Núñez, 2017, Improving SVM classification on imbalanced datasets by introducing a new bias, J. Classif., 34, 427, 10.1007/s00357-017-9242-x Brzezinski, 2019, On the dynamics of classification measures for imbalanced and streaming data, IEEE Trans. Neural Netw. Learn. Syst., 1, 10.1109/TNNLS.2019.2899061 Yamazaki, 2007, Asymptotic bayesian generalization error when training and test distributions are different, 1079 Alaiz-Rodríguez, 2008, Assessing the impact of changing environments on classifier performance, 13 Rudd, 2018, The extreme value machine, IEEE Trans. Pattern Anal. Mach. Intell., 40, 762, 10.1109/TPAMI.2017.2707495 Bradley, 2006, Precision-recall operating characteristic (p-roc) curves in imprecise environments, 4, 123 F. Yu, Y. Zhang, S. Song, A. Seff, J. Xiao, LSUN: Construction of a large-scale image dataset using deep learning with humans in the loop, arXiv preprint arXiv:1506.03365 (2015). Deng, 2009, ImageNet: a large-scale hierarchical image database, 248 López, 2014, On the importance of the validation technique for classification with imbalanced datasets: Addressing covariate shift when data is skewed, Inf. Sci., 257, 1, 10.1016/j.ins.2013.09.038 Datta, 2018, Multiobjective support vector machines: Handling class imbalance with pareto optimality, IEEE Trans. Neural Netw. Learn. Syst., 30, 1602, 10.1109/TNNLS.2018.2869298 Szegedy, 2016, Rethinking the inception architecture for computer vision, 2818 Datta, 2019, Boosting with lexicographic programming: Addressing class imbalance without cost tuning, IEEE Trans. Knowl. Data Eng., 10.1109/TKDE.2019.2894148 Datta, 2015, Near-bayesian support vector machines for imbalanced data classification with equal or unequal misclassification costs, Neural Netw., 70, 39, 10.1016/j.neunet.2015.06.005 Seiffert, 2010, RUSBoost: a hybrid approach to alleviating class imbalance, IEEE Trans. Syst. Man Cybern. Part A: Syst. Hum., 40, 185, 10.1109/TSMCA.2009.2029559 Japkowicz, 2000, The class imbalance problem: Significance and strategies, 111 Breiman, 1984 Chawla, 2002, SMOTE: synthetic minority over-sampling technique, J. Artif. Intell. Res., 16, 321, 10.1613/jair.953 Madjarov, 2012, An extensive experimental comparison of methods for multi-label learning, Pattern Recognit., 45, 3084, 10.1016/j.patcog.2012.03.004 Carbonneau, 2018, Multiple instance learning: a survey of problem characteristics and applications, Pattern Recognit., 77, 329, 10.1016/j.patcog.2017.10.009 Lin, 2017, Focal loss for dense object detection, 2980 Zimbra, 2018, The state-of-the-art in twitter sentiment analysis: a review and benchmark evaluation, ACM Trans. Manag. Inf. Syst., 9, 5:1, 10.1145/3185045 Rudd, 2017, The extreme value machine, IEEE Trans. Pattern Anal. Mach. Intell., 40, 762, 10.1109/TPAMI.2017.2707495