Class-imbalanced dynamic financial distress prediction based on Adaboost-SVM ensemble combined with SMOTE and time weighting

Information Fusion - Tập 54 - Trang 128-144 - 2020
Jie Sun1, Hui Li2, Hamido Fujita3, Binbin Fu4, Wenguo Ai5
1School of Accountancy, Tianjin University of Finance and Economics, Tianjin, PR China
2College of Tourism and Service Management, Nankai University, Tianjin, PR China
3Faculty of Software and Information Science, Iwate Prefectural University, Iwate, Japan
4School of Economics and Management, Zhejiang Normal University, Jinhua, Zhejiang Province, PR China
5Management School, Harbin Institute of Technology, Harbin, Heilongjiang Province, PR China

Tài liệu tham khảo

Beaver, 1966, Financial ratios as predictors of failure, J. Account. Res., 4, S71, 10.2307/2490171 Altman, 1968, Financial ratios discriminant analysis and the prediction of corporate bankruptcy, J. Financ., 23, 589, 10.1111/j.1540-6261.1968.tb00843.x Ohlson, 1980, Financial ratio and the probabilistic prediction of bankruptcy, J. Account. Res., 18, 109, 10.2307/2490395 Frydman, 1985, Introducing recursive partitioning for financial classification: the case of financial distress, J. Financ., 40, 269, 10.1111/j.1540-6261.1985.tb04949.x Odom, 1990, Neural networks model for bankruptcy prediction, 2, 163 Ding, 2008, Forecasting financial condition of Chinese listed companies based on support vector machine, Expert Syst. Appl., 34, 3081, 10.1016/j.eswa.2007.06.037 Sun, 2006, FDP based on similarity weighted voting CBR, Lect. Notes Artif. Intell., 4093, 947 Zhang, 2010, Vertical bagging decision trees model for credit scoring, Expert Syst. Appl., 37, 7838, 10.1016/j.eswa.2010.04.054 Wang, 2011, A comparative assessment of ensemble learning for credit scoring, Expert Syst. Appl., 38, 223, 10.1016/j.eswa.2010.06.048 Wang, 2012, A hybrid ensemble approach for enterprise credit risk assessment based on support vector machine, Expert Syst. Appl., 39, 5325, 10.1016/j.eswa.2011.11.003 Sun, 2011, Dynamic financial distress prediction using instance selection for the disposal of concept drift, Expert Syst. Appl., 38, 2566, 10.1016/j.eswa.2010.08.046 Sun, 2013, Concept drift-oriented adaptive and dynamic support vector machine ensemble with time window in corporate financial risk prediction, IEEE Trans. Syst. Man Cybern., 43, 801, 10.1109/TSMCA.2012.2224338 Liu, 2017, Dynamic forecasting of financial distress: the hybrid use of incremental bagging and genetic algorithm—empirical study of Chinese listed corporations, Risk Manag., 19, 32, 10.1057/s41283-016-0012-6 Sun, 2017, Dynamic financial distress prediction with concept drift based on time weighting combined with Adaboost support vector machine ensemble, Knowl. Based Syst., 120, 4, 10.1016/j.knosys.2016.12.019 Crone, 2012, Instance sampling in credit scoring: an empirical study of sample size and balancing, Int. J. Forecast., 28, 224, 10.1016/j.ijforecast.2011.07.006 Sun, 2014, Imbalance-oriented SVM methods for financial distress prediction: a comparative study among the new SB-SVM-ensemble method and traditional methods, J. Oper. Res. Soc., 65, 1905, 10.1057/jors.2013.117 Brown, 2012, An experimental comparison of classification algorithms for imbalanced credit scoring data sets, Expert Syst. Appl., 39, 3446, 10.1016/j.eswa.2011.09.033 Yu, 2018, A DBN-based resampling SVM ensemble learning paradigm for credit classification with imbalanced data, Appl. Soft Comput., 69, 192, 10.1016/j.asoc.2018.04.049 Papouskova, 2019, Two-stage consumer credit risk modelling using heterogeneous ensemble learning, Decis. Support Syst., 118, 33, 10.1016/j.dss.2019.01.002 Varetto, 1998, Genetic algorithms applications in the analysis of insolvency risk, J. Bank. Financ., 22, 1421, 10.1016/S0378-4266(98)00059-4 Vapnik, 1998 Mselmi, 2017, Financial distress prediction: the case of French small and medium-sized firms, Int. Rev. Financ. Anal., 50, 67, 10.1016/j.irfa.2017.02.004 Barboza, 2017, Machine learning models and bankruptcy prediction, Expert Syst. Appl., 83, 405, 10.1016/j.eswa.2017.04.006 Zhou, 2017, One versus one multi-class classification fusion using optimizing decision directed acyclic graph for predicting listing status of companies, Inf. Fusion, 36, 80, 10.1016/j.inffus.2016.11.009 Holmes, 2017, Predicting financial distress and closure in rural hospitals, J. Rural Health, 33, 239, 10.1111/jrh.12187 Sayari, 2017, Industry specific financial distress modeling, Bus. Res. Q., 20, 45, 10.1016/j.brq.2016.03.003 Li, 2017, Predicting hospitality firm failure: mixed sample modelling, Int. J. Contemp. Hosp. Manag., 29, 1770, 10.1108/IJCHM-03-2015-0092 Li, 2019, Failure analysis of corporations with multiple hospitality businesses, Tour. Manag., 73, 21, 10.1016/j.tourman.2019.01.018 Fernández, 2018 Garcia, 2009, Learning from imbalanced data, IEEE Trans. Knowl. Data Eng., 21, 1263, 10.1109/TKDE.2008.239 Louzada, 2012, On the impact of disproportional samples in credit scoring models: an application to a Brazilian bank data, Expert Syst. Appl., 39, 8071, 10.1016/j.eswa.2012.01.134 Fernández, 2018, SMOTE for learning from imbalanced data: progress and challenges, marking the 15-year anniversary, J. Artif. Intell. Res., 61, 863, 10.1613/jair.1.11192 Chawla, 2002, SMOTE: synthetic minority over-sampling technique, J. Artif. Intell. Res., 16, 321, 10.1613/jair.953 H. Li, J. Sun. Forecasting business failure: the use of nearest-neighbour support vectors and correcting imbalanced samples: evidence from the Chinese hotel industry. Tourism Tour. ManagementManag., 2012, 3(33): 622–634. Tang, 2009, SVMs modeling for highly imbalanced classification, IEEE Trans. Syst. Man Cybern., 39, 281, 10.1109/TSMCB.2008.2002909 Li, 2018, Restructuring performance prediction with a rebalanced and clustered support vector machine, J. Forecast., 37, 437, 10.1002/for.2512 Cordón, 2018, Imbalance: oversampling algorithms for imbalanced classification in R, Knowl. Based Syst., 161, 329, 10.1016/j.knosys.2018.07.035 Barua, 2014, MWMOTE–majority weighted minority oversampling technique for imbalanced data set learning, IEEE Trans. Knowl. Data Eng., 26, 405, 10.1109/TKDE.2012.232 Das, 2015, RACOG and wRACOG: two probabilistic over- sampling techniques, IEEE Trans. Knowl. Data Eng., 27, 222, 10.1109/TKDE.2014.2324567 Zhang, 2014, RWO-sampling: a random walk over-sampling approach to imbalanced data classification, Inf. Fusion, 20, 99, 10.1016/j.inffus.2013.12.003 Gao, 2014, PDFOS: PDF estimation based over-sampling for imbalanced two-class problems, Neurocomputing, 138, 248, 10.1016/j.neucom.2014.02.006 Cleofas-Sánchez, 2016, Financial distress prediction using the hybrid associative memory with translation, Appl. Soft Comput., 44, 144, 10.1016/j.asoc.2016.04.005 Raghuwanshi, 2018, Class-specific kernelized extreme learning machine for binary class imbalance learning, Appl. Soft Comput., 73, 1026, 10.1016/j.asoc.2018.10.011 Chawla, 2003, SMOTEBoost: improving prediction of the minority class in Boosting, 107 Gónzalez, 2017, Class switching according to nearest enemy distance for learning from highly imbalanced data-sets, Pattern Recognit., 70, 12, 10.1016/j.patcog.2017.04.028 García, 2018, Dynamic ensemble selection for multi-class imbalanced datasets, Inf. Sci., 445-446, 22, 10.1016/j.ins.2018.03.002 Zhang, 2018, DRCW-ASEG: one-versus-one distance-based relative competence weighting with adaptive synthetic example generation for multi-class imbalanced datasets, Neurocomputing, 285, 176, 10.1016/j.neucom.2018.01.039 Sun, 2018, Imbalanced enterprise credit evaluation with DTE-SBD: decision tree ensemble based on SMOTE and bagging with differentiated sampling rates, Inf. Sci., 425, 76, 10.1016/j.ins.2017.10.017 Schlimmer, 1986, Incremental learning from noisy data, Mach. Learn., 1, 317, 10.1007/BF00116895 Hoens, 2012, Learning from streaming data with concept drift and imbalance: an overview, Prog. Artif. Intell., 1, 89, 10.1007/s13748-011-0008-0 Mitchell, 1994, Experience with a learning personal assistant, Commun. ACM, 37, 81, 10.1145/176789.176798 Klinkenberg, 2004, Learning drifting concepts: example selection vs. example weighting, Intell. Data Anal., 3, 281, 10.3233/IDA-2004-8305 Krawczyk, 2017, Ensemble learning for data stream analysis: a survey, Inf. Fusion, 37, 132, 10.1016/j.inffus.2017.02.004 Wu, 2009, An adaptive ensemble classifiers for concept drifting stream, Proc. Comput. Intell. Data Min., 69 Almeida, 2018, Adapting dynamic classifier selection for concept drift, Expert Syst. Appl., 104, 67, 10.1016/j.eswa.2018.03.021 Ni, 2014, Research of multiple kernel SVM based on manifold learning in financial distress prediction, Syst. Eng. – Theory Pract., 34, 2666