Exploring the synergetic effects of sample types on the performance of ensembles for credit risk and corporate bankruptcy prediction

Information Fusion - Tập 47 - Trang 88-101 - 2019
Vicente García1, Ana I. Marqués2, J. Salvador Sánchez3
1Multidisciplinary University Division, Universidad Autónoma de Ciudad Juárez Ciudad Juárez, Chihuahua, Mexico
2Department of Business Administration and Marketing, Universitat Jaume I Castelló de la Plana, Spain
3Institute of New Imaging Technologies, Department of Computer Languages and Systems, Universitat Jaume I, Castelló de la Plana, Spain

Tài liệu tham khảo

Altman, 1968, Financial ratios, discriminant analysis and the prediction of corporate bankruptcy, J. Finance, 23, 589, 10.1111/j.1540-6261.1968.tb00843.x Ohlson, 1980, Financial ratios and the probabilistic prediction of bankruptcy, J. Account. Res., 18, 109, 10.2307/2490395 Zavgren, 1985, Assessing the vulnerability to failure of american industrial firms: a logistic analysis, J. Bus. Finance Account., 12, 19, 10.1111/j.1468-5957.1985.tb00077.x Allen, 2006, Financial survival analysis of defaulted debtors, J. Oper. Res. Soc., 57, 630, 10.1057/palgrave.jors.2602038 Lee, 2006, Mining the customer credit using classification and regression tree and multivariate adaptive regression splines, Comput. Stat. Data An., 50, 1113, 10.1016/j.csda.2004.11.006 Karels, 1987, Multivariate normality and forecasting of business bankruptcy, J. Bus. Finance Account., 14, 573, 10.1111/j.1468-5957.1987.tb00113.x Mihalovic, 2016, Performance comparison of multiple discriminant analysis and logit models in bankruptcy prediction, Econ. Sociol., 9, 101, 10.14254/2071-789X.2016/9-4/6 Shin, 2005, An application of support vector machines in bankruptcy prediction model, Expert Syst. Appl., 28, 127, 10.1016/j.eswa.2004.08.009 Ding, 2008, Forecasting financial condition of Chinese listed companies based on support vector machine, Expert Syst. Appl., 34, 3081, 10.1016/j.eswa.2007.06.037 Erdogan, 2013, Prediction of bankruptcy using support vector machines: an application to bank bankruptcy, J. Stat. Comput. Sim., 83, 1543, 10.1080/00949655.2012.666550 Kim, 2003, The discovery of experts’ decision rules from qualitative bankruptcy data using genetic algorithms, Expert Syst. Appl., 25, 637, 10.1016/S0957-4174(03)00102-7 Lensberg, 2006, Bankruptcy theory development and classification via genetic programming, Eur. J. Oper. Res., 169, 677, 10.1016/j.ejor.2004.06.013 Acosta-González, 2014, Forecasting financial failure of firms via genetic algorithms, Comput. Econ., 43, 133, 10.1007/s10614-013-9392-9 Atiya, 2001, Bankruptcy prediction for credit risk using neural networks: a survey and new results, IEEE T. Neural Networ., 12, 929, 10.1109/72.935101 Sun, 2007, Using Bayesian networks for bankruptcy prediction: some methodological issues, Eur. J. Oper. Res., 180, 738, 10.1016/j.ejor.2006.04.019 Khashman, 2011, Credit risk evaluation using neural networks: emotional versus conventional models, Appl. Soft Comput., 11, 5477, 10.1016/j.asoc.2011.05.011 Cleofas-Sánchez, 2016, Financial distress prediction using the hybrid associative memory with translation, Appl. Soft Comput., 44, 144, 10.1016/j.asoc.2016.04.005 Zhao, 2016, An effective computational model for bankruptcy prediction using kernel extreme learning machine approach, Comput. Econ., 1 Slowinski, 1995, Application of the rough set approach to evaluation of bankruptcy risk, Intell. Syst. Account. Finance Manag., 4, 27, 10.1002/j.1099-1174.1995.tb00078.x Mckee, 2000, Developing a bankruptcy prediction model via rough sets theory, Intell. Syst. Account. Finance Manag., 9, 159, 10.1002/1099-1174(200009)9:3<159::AID-ISAF184>3.0.CO;2-C Cabedo, 2015, Rough sets and discriminant analysis techniques for business default forecasting, Fuzzy Econ. Rev., 20, 3, 10.25102/fer.2015.01.01 Delen, 2013, Measuring firm performance using financial ratios: a decision tree approach, Expert Syst. Appl., 40, 3970, 10.1016/j.eswa.2013.01.012 Kim, 2014, Predicting restaurant financial distress using decision tree and AdaBoosted decision tree models, Econ. Model., 36, 354, 10.1016/j.econmod.2013.10.005 West, 2005, Neural network ensemble strategies for financial decision applications, Comput. Oper. Res., 32, 2543, 10.1016/j.cor.2004.03.017 Doumpos, 2007, Model combination for credit risk assessment: a stacked generalization approach, Ann. Oper. Res., 151, 289, 10.1007/s10479-006-0120-x Alfaro, 2008, Bankruptcy forecasting: An empirical comparison of AdaBoost and neural networks, Decis. Support Syst., 45, 110, 10.1016/j.dss.2007.12.002 Sun, 2012, Financial distress prediction using support vector machines: Ensemble vs. individual, Appl. Soft Comput., 12, 2254, 10.1016/j.asoc.2012.03.028 Pluto, 2006, Estimating probabilities of default for low default portfolios, 75 Twala, 2009, Combining classifiers for credit risk prediction, J. Sys. Scien. and Sys. Eng., 18, 292, 10.1007/s11518-009-5109-y Xiao, 2014, A transfer learning based classifier ensemble model for customer credit scoring, 64 Das, 2018, Handling data irregularities in classification: foundations, trends, and future challenges, Pattern Recogn., 81, 674, 10.1016/j.patcog.2018.03.008 Brown, 2012, An experimental comparison of classification algorithms for imbalanced credit scoring data sets, Expert Syst. Appl., 39, 3446, 10.1016/j.eswa.2011.09.033 Abellán, 2017, A comparative study on base classifiers in ensemble methods for credit scoring, Expert Syst. Appl., 73, 1, 10.1016/j.eswa.2016.12.020 Feng, 2018, Dynamic ensemble classification for credit scoring using soft probability, Appl. Soft Comput., 65, 139, 10.1016/j.asoc.2018.01.021 Xiao, 2012, Dynamic classifier ensemble model for customer classification with imbalanced class distribution, Expert Syst. Appl., 39, 3668, 10.1016/j.eswa.2011.09.059 He, 2018, A novel ensemble method for credit scoring: adaption of different imbalance ratios, Expert Syst. Appl., 98, 105, 10.1016/j.eswa.2018.01.012 Sun, 2018, Imbalanced enterprise credit evaluation with DTE-SBD: decision tree ensemble based on SMOTE and bagging with differentiated sampling rates, Inform. Sci., 425, 76, 10.1016/j.ins.2017.10.017 Wang, 2015, Large unbalanced credit scoring using Lasso-Logistic regression ensemble, PLoS ONE, 10, 1 Sun, 2014, Imbalance-oriented SVM methods for financial distress prediction: a comparative study among the new SB-SVM-ensemble method and traditional methods, J. Oper. Res. Soc., 65, 1905, 10.1057/jors.2013.117 Louzada, 2011, Poly-bagging predictors for classification modelling for credit scoring, Expert Syst. Appl., 38, 12717, 10.1016/j.eswa.2011.04.059 Xia, 2018, A novel heterogeneous ensemble credit scoring model based on bstacking approach, Expert Syst. Appl., 93, 182, 10.1016/j.eswa.2017.10.022 Yu, 2018, A dbn-based resampling SVM ensemble learning paradigm for credit classification with imbalanced data, Appl. Soft Comput., 69, 192, 10.1016/j.asoc.2018.04.049 Ala’raj, 2016, Classifiers consensus system approach for credit scoring, Knowl.-Based Syst., 104, 89, 10.1016/j.knosys.2016.04.013 Florez-Lopez, 2015, Enhancing accuracy and interpretability of ensemble strategies in credit risk assessment. a correlated-adjusted decision forest proposal, Expert Syst. Appl., 42, 5737, 10.1016/j.eswa.2015.02.042 Kim, 2015, Geometric mean based boosting algorithm with over-sampling to resolve data imbalance problem for bankruptcy prediction, Expert Syst. Appl., 42, 1074, 10.1016/j.eswa.2014.08.025 Ziba, 2016, Ensemble boosted trees with synthetic features generation in application to bankruptcy prediction, Expert Syst. Appl., 58, 93, 10.1016/j.eswa.2016.04.001 Li, 2012, Relevance vector machine based infinite decision agent ensemble learning for credit risk analysis, Expert Syst. Appl., 39, 4947, 10.1016/j.eswa.2011.10.022 Twala, 2010, Multiple classifier application to credit risk assessment, Expert Syst. Appl., 37, 3326, 10.1016/j.eswa.2009.10.018 M.A. Muslim, A. Nurzahputra, B. Prasetiyo, Improving accuracy of C4.5 algorithm using split feature reduction model and bagging ensemble for credit card risk prediction, in: International Conference on Information and Communications Technology, pp. 141–145. Xia, 2017, A boosted decision tree approach using Bayesian hyper-parameter optimization for credit scoring, Expert Syst. Appl., 78, 225, 10.1016/j.eswa.2017.02.017 Koutanaei, 2015, A hybrid data mining model of feature selection algorithms and ensemble learning classifiers for credit scoring, J. Retailing Cons. Ser., 27, 11, 10.1016/j.jretconser.2015.07.003 Wang, 2014, An improved boosting based on feature selection for corporate bankruptcy prediction, Expert Syst. Appl., 41, 2353, 10.1016/j.eswa.2013.09.033 Ala’raj, 2016, A new hybrid ensemble credit scoring model based on classifiers consensus system approach, Expert Syst. Appl., 64, 36, 10.1016/j.eswa.2016.07.017 Liao, 2014, An ensemble-based model for two-class imbalanced financial problem, Econ. Model., 37, 175, 10.1016/j.econmod.2013.11.013 Wang, 2012, Two credit scoring models based on dual strategy ensemble trees, Knowl.-Based Syst., 26, 61, 10.1016/j.knosys.2011.06.020 Paleologo, 2010, Subagging for credit scoring models, Eur. J. Oper. Res., 201, 490, 10.1016/j.ejor.2009.03.008 Napierala, 2010, Learning from imbalanced data in presence of noisy and borderline examples, 158 Napierala, 2012, The influence of minority class distribution on learning from imbalance data, 139 Kubat, 1997, Addressing the curse of imbalanced training sets: one-sided selection, 179 Napierala, 2016, Types of minority class examples and their influence on learning classifiers from imbalanced data, J. Intell. Inf. Syst., 46, 563, 10.1007/s10844-015-0368-1 Sáez, 2016, Analyzing the oversampling of different classes and types of examples in multi-class imbalanced datasets, Pattern Recogn., 57, 164, 10.1016/j.patcog.2016.03.012 Krawczyk, 2014, Weighted one-class classification for different types of minority class examples in imbalanced data, 337 Skryjomski, 2017, Influence of minority class instance types on SMOTE imbalanced data oversampling, vol. 74, 7 Lichman, 2013 Jardin, 2007 Pietruszkiewicz, 2008, Dynamical systems and nonlinear Kalman filtering applied in classification, 263 Thomas, 2002 Yeh, 2009, The comparisons of data mining techniques for the predictive accuracy of probability of default of credit card clients, Expert Syst. Appl., 36, 2473, 10.1016/j.eswa.2007.12.020 Sabzevari, 2007, A comparison between statistical and data mining methods for credit scoring in case of limited available data García, 2015, An insight into the experimental design for credit risk and corporate bankruptcy prediction systems, J. Intell. Inf. Syst., 44, 159, 10.1007/s10844-014-0333-4 Caouette, 2008 Breiman, 1996, Bagging predictors, Mach. Learn., 24, 123, 10.1007/BF00058655 Freund, 1996, Experiments with a new boosting algorithm, 148 Friedman, 2002, Stochastic gradient boosting, Comput. Stat. Data Anal., 38, 367, 10.1016/S0167-9473(01)00065-2 Ho, 1998, The random subspace method for constructing decision forests, IEEE Trans. Pattern Anal. Mach. Intell., 20, 832, 10.1109/34.709601 Rodríguez, 2006, Rotation forest: a new classifier ensemble method, IEEE Trans. Pattern Anal. Mach. Intell., 28, 1619, 10.1109/TPAMI.2006.211 Breiman, 2001, Random forests, Mach. Learn., 45, 5, 10.1023/A:1010933404324 Melville, 2005, Creating diversity in ensembles using artificial data, Inform. Fusion, 6, 99, 10.1016/j.inffus.2004.04.001 Stefanowski, 2016, Dealing with data difficulty factors while learning from imbalanced data, 333 Drummond, 2006, Cost curves: an improved method for visualizing classifier performance, Mach. Learn., 65, 95, 10.1007/s10994-006-8199-5 Lobo, 2008, AUC: a misleading measure of the performance of predictive distribution models, Global Ecol. Biogeogr., 17, 145, 10.1111/j.1466-8238.2007.00358.x Hand, 2009, Measuring classifier performance: a coherent alternative to the area under the ROC curve, Mach. Learn., 77, 103, 10.1007/s10994-009-5119-5 Finlay, 2011, Multiple classifier architectures and their application to credit risk assessment, Eur. J. Oper. Res., 210, 368, 10.1016/j.ejor.2010.09.029