A new deep learning ensemble credit risk evaluation model with an improved synthetic minority oversampling technique

Applied Soft Computing - Tập 98 - Trang 106852 - 2021
Feng Shen1,2, Xingchao Zhao1, Gang Kou3, Fawaz E. Alsaadi4
1School of Finance, Southwestern University of Finance and Economics, Chengdu, 611130, PR China
2Fintech Innovation Center, Southwestern University of Finance and Economics, Chengdu 611130, PR China
3School of Business Administration, Southwestern University of Finance and Economics, Chengdu 611130, PR China
4Department of Information Technology, Faculty of Computing and IT, King Abdulaziz University, Jeddah, Saudi Arabia

Tài liệu tham khảo

Dastile, 2020, Statistical and machine learning models in credit scoring: A systematic literature survey, Appl. Soft Comput. J., 10.1016/j.asoc.2020.106263 Mihalovic, 2016, Performance comparison of multiple discriminant analysis and logit models in bankruptcy prediction, Econ. Sociol., 9, 101, 10.14254/2071-789X.2016/9-4/6 Huang, 2018, Enterprise credit risk evaluation based on neural network algorithm, Cogn. Syst. Res., 52, 317, 10.1016/j.cogsys.2018.07.023 Wu, 2019, Feature construction for fraudulent credit card cash-out detection, Decis. Support Syst., 127, 10.1016/j.dss.2019.113155 Sun, 2018, Imbalanced enterprise credit evaluation with DTE-SBD: Decision tree ensemble based on SMOTE and bagging with differentiated sampling rates, Inform. Sci., 425, 76, 10.1016/j.ins.2017.10.017 Lessmann, 2015, Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research, European J. Oper. Res., 247, 124, 10.1016/j.ejor.2015.05.030 Pławiak, 2019, Application of new deep genetic cascade ensemble of SVM classifiers to predict the Australian credit scoring, Appl. Soft Comput., 84, 10.1016/j.asoc.2019.105740 Kim, 2019, An ensemble semi-supervised learning method for predicting defaults in social lending, Eng. Appl. Artif. Intell., 81, 193, 10.1016/j.engappai.2019.02.014 Sezer, 2020, Financial time series forecasting with deep learning: A systematic literature review: 2005–2019, Appl. Soft Comput. J., 10.1016/j.asoc.2020.106181 Yishen, 2017, Credit risk assessment based on long short-term memory model, Intell. Comput. Theor. Appl., 70, 0 Wang, 2019, A deep learning approach for credit scoring of Peer-to-Peer lending using attention mechanism LSTM, IEEE Access, 7, 2161, 10.1109/ACCESS.2018.2887138 Gupta, 2019, Deep learning research: Scientometric assessment of Global Publications Output during 2004-17, Emerg. Sci. J., 3, 23, 10.28991/esj-2019-01165 Munkhdalai, 2020, GEV-NN: A deep neural network architecture for class imbalance problem in binary classification, Knowl.-Based Syst., 10.1016/j.knosys.2020.105534 Yu, 2018, A DBN-based resampling SVM ensemble learning paradigm for credit classification with imbalanced data, Appl. Soft Comput., 69, 192, 10.1016/j.asoc.2018.04.049 Crone, 2012, Instance sampling in credit scoring: an empirical study of sample size and balancing, Int. J. Forecast., 28, 224, 10.1016/j.ijforecast.2011.07.006 Krawczyk, 2016, Learning from imbalanced data: open challenges and future directions, Prog. Artif. Intell., 5, 221, 10.1007/s13748-016-0094-0 Chawla, 2002, SMOTE: synthetic minority over-sampling technique, J. Artificial Intelligence Res., 16, 321, 10.1613/jair.953 Abdi, 2015, To combat multi-class imbalanced problems by means of over-sampling techniques, IEEE Trans. Knowl. Data Eng., 28, 238, 10.1109/TKDE.2015.2458858 Xia, 2017, Cost-sensitive boosted tree for loan evaluation in peer-to-peer lending, Electron. Commer. Res. Appl., 24, 30, 10.1016/j.elerap.2017.06.004 Czajkowskia, 2015, Cost-sensitive Global Model Trees applied to loan charge-off forecasting, Decis. Support Syst., 74, 57, 10.1016/j.dss.2015.03.009 Galar, 2012, A review on ensembles for the class imbalance problem: bagging-, boosting-, and hybrid-based approaches, IEEE Trans. Syst. Man Cybern. C (Appl. Rev.), 42, 463, 10.1109/TSMCC.2011.2161285 Kim, 2015, Geometric mean based boosting algorithm with oversampling to resolve data imbalance problem for bankruptcy prediction, Expert Syst. Appl., 42, 1074, 10.1016/j.eswa.2014.08.025 Song, 2019, A MCDM-based evaluation approach for imbalanced classification methods in financial risk prediction, IEEE Access, 7, 84897, 10.1109/ACCESS.2019.2924923 Chen, 2010, Ramoboost: ranked minority oversampling in boosting, IEEE Trans. Neural Netw., 21, 1624, 10.1109/TNN.2010.2066988 González-Carratoa, 2018, Wind farm monitoring using mahalanobis distance and fuzzy clustering, Renew. Energy Hochreiter, 1997, Long short-term memory, Neural Comput., 9, 1735, 10.1162/neco.1997.9.8.1735 Kim, 2018, Forecasting the volatility of stock price index: A hybrid model integrating LSTM with multiple GARCH-type models, Expert Syst. Appl., 103, 25, 10.1016/j.eswa.2018.03.002 Gers, 2002, Learning precise timing with LSTM recurrent networks, J. Mach. Learn. Res., 3, 115 Jones, 2015, An empirical evaluation of the performance of binary classifiers in the prediction of credit ratings changes, J. Bank. Financ., 56, 72, 10.1016/j.jbankfin.2015.02.006 A. Asuncion, D. Newman, UCI Machine Learning Repository, 2007 Irvine, CA: University of California, School of Information and Computer Science. http://archive.ics.uci.edu/ml. Pedregosa, 2011, Scikit-learn: Machine learning in Python, J. Mach. Learn. Res., 12, 2825 Cawley, 2010, On over-fitting in model selection and subsequent selection Bias in Performance Evaluation, J. Mach. Learn. Res., 11, 2079 Robinson, 2020, Validating the validation: reanalyzing a large-scale comparison of deep learning and machine learning models for bioactivity prediction, J. Comput. Aided Mol. Des., 10.1007/s10822-019-00274-0 Shi, 2019, Variable selection and validation in multivariate modelling, Bioinformatics, 35, 972, 10.1093/bioinformatics/bty710 Haixiang, 2017, Learning from class-imbalanced data: Review of methods and applications, Expert Syst. Appl., 73, 220, 10.1016/j.eswa.2016.12.035 Carta, 2020, A combined entropy-based approach for a proactive credit scoring, Eng. Appl. Artif. Intell., 87, 10.1016/j.engappai.2019.103292 Teply, 2020, Best classification algorithms in peer-to-peer lending, N. Am. J. Econ. Finance, 51, 10.1016/j.najef.2019.01.001 Garcia, 2015, An insight into the experimental design for credit risk and corporate bankruptcy prediction systems, J. Intell. Inf. Syst., 44, 159, 10.1007/s10844-014-0333-4 Tsaih, 2004, Credit scoring system for small business loans, Decis. Support Syst., 38, 91, 10.1016/S0167-9236(03)00079-4 Worku, 2019, The uptake of E-Commerce services in johannesburg, Civ. Eng. J., 5, 349, 10.28991/cej-2019-03091250