2-stage modified random forest model for credit risk assessment of P2P network lending to “Three Rurals” borrowers

Applied Soft Computing - Tập 95 - Trang 106570 - 2020
Congjun Rao1, Ming Liu1, Mark Goh2, Jianghui Wen1
1School of Science, Wuhan University of Technology, Wuhan 430070, PR China
2NUS Business School & The Logistics Institute-Asia Pacific, National University of Singapore, 119623, Singapore

Tóm tắt

Từ khóa

Tài liệu tham khảo

Gao, 2018, The performance of the P2P finance industry in China, Electron. Commer. Res. Appl., 30, 138, 10.1016/j.elerap.2018.06.002

Liu, 2019, Platform competition in peer-to-peer lending considering risk control ability, European J. Oper. Res., 274, 280, 10.1016/j.ejor.2018.09.024

Wiginton, 1980, A note on the comparison of logit and discriminant models of consumer credit behavior, J. Financ. Quant. Anal., 15, 757, 10.2307/2330408

Wang, 2016, Probabilistic framework of visual anomaly detection for unbalanced data, Neurocomputing, 201, 12, 10.1016/j.neucom.2016.03.038

Chawla, 2002, SMOTE: synthetic minority over-sampling technique, J. Artificial Intelligence Res., 16, 321, 10.1613/jair.953

Niu, 2020, Resampling ensemble model based on data distribution for imbalanced credit risk evaluation in P2P lending, Inform. Sci., 536, 120, 10.1016/j.ins.2020.05.040

Xie, 2009, Customer churn prediction using improved balanced random forests, Expert Syst. Appl., 36, 5445, 10.1016/j.eswa.2008.06.121

Mo, 2010, A two-stage clustering approach for multi-region segmentation, Expert Syst. Appl., 37, 7120, 10.1016/j.eswa.2010.03.003

Wang, 2018, A novel behavioral scoring model for estimating probability of default over time in peer-to-peer lending, Electron. Commer. Res. Appl., 27, 74, 10.1016/j.elerap.2017.12.006

Mercadier, 2019, Credit spread approximation and improvement using random forest regression, European J. Oper. Res., 277, 351, 10.1016/j.ejor.2019.02.005

Chen, 2018, Structured random forest for label distribution learning, Neurocomputing, 320, 171, 10.1016/j.neucom.2018.09.002

Zerbini, 2019, Wavelet against random forest for anomaly mitigation in software-defined networking, Appl. Soft Comput., 80, 138, 10.1016/j.asoc.2019.02.046

Xia, 2017, Cost-sensitive boosted tree for loan evaluation in peer-to-peer lending, Electron. Commer. Res. Appl., 24, 30, 10.1016/j.elerap.2017.06.004

Yang, 2018, Classification algorithm of unbalanced data based on cost-sensitive random forest, Sci. Technol. Eng., 18, 285

Ju, 2018, Research on the evaluation mechanism of personal credit in the internet era -a case study of Sesame credit, Mod. Manag. Sci., 302, 111

Ye, 2018, Loan evaluation in p2p lending based on random forest optimized by genetic algorithm with profit score, Electron. Commer. Res. Appl., 32, 23, 10.1016/j.elerap.2018.10.004

Dorfleitner, 2016, Description-text related soft information in peer-to-peer lending-evidence from two leading european platforms, J. Bank. Financ., 64, 169, 10.1016/j.jbankfin.2015.11.009

Rao, 2019, Design of comprehensive evaluation index system for P2P credit risk of Three Rural borrowers, Soft Comput.

Zhu, 2005, Cost-constrained data acquisition for intelligent data preparation, IEEE Trans. Knowl. Data Eng., 17, 1542, 10.1109/TKDE.2005.176

Tan, 1993, Cost-sensitive learning of classification knowledge and its applications in robotics, Mach. Learn., 13, 7, 10.1007/BF00993101

Kamel, 2007, Cost-sensitive boosting for classification of imbalanced data, Pattern Recognit., 40, 3358, 10.1016/j.patcog.2007.04.009

Krawczyk, 2014, Cost-sensitive decision tree ensembles for effective imbalanced classification, Appl. Soft Comput., 14, 554, 10.1016/j.asoc.2013.08.014

Tapkan, 2016, A cost-sensitive classification algorithm: BEE-miner, Knowl.-Based Syst., 95, 99, 10.1016/j.knosys.2015.12.010

Tao, 2019, Self-adaptive cost weights-based support vector machine cost-sensitive ensemble for imbalanced data classification, Inform. Sci., 487, 31, 10.1016/j.ins.2019.02.062

Castro, 2013, Novel cost-sensitive approach to improve the multilayer perceptron performance on imbalanced data, IEEE Trans. Neural Netw. Learn. Syst., 24, 888, 10.1109/TNNLS.2013.2246188

Mu, 2018, A pearson’s correlation coefficient based decision tree and its parallel implementation, Inform. Sci., 435, 40, 10.1016/j.ins.2017.12.059

Jadhav, 2018, Information gain directed genetic algorithm wrapper feature selection for credit rating, Appl. Soft Comput., 69, 541, 10.1016/j.asoc.2018.04.033

Turney, 2000, Types of cost in inductive concept learning, 15

Krawczyk, 2014, Cost-sensitive decision tree ensembles for effective imbalanced classification, Appl. Soft Comput., 14, 554, 10.1016/j.asoc.2013.08.014

Lee, 2012, A novel algorithm applied to classify unbalanced data, Appl. Soft Comput., 12, 2481, 10.1016/j.asoc.2012.03.051

Li, 2018, Cost-sensitive and hybrid-attribute measure multi-decision tree over imbalanced data sets, Inform. Sci., 422, 242, 10.1016/j.ins.2017.09.013

Yang, 2019, Improved cost-sensitive random forest for imbalanced classification, J. Comput., 30, 213

Pu, 2019, Mountain railway alignment optimization using stepwise & hybrid particle swarm optimization incorporating genetic operators, Appl. Soft Comput., 78, 41, 10.1016/j.asoc.2019.01.051

Pedemonte, 2018, A theoretical and empirical study of the trajectories of solutions on the grid of Systolic Genetic Search, Inform. Sci., 445–446, 97, 10.1016/j.ins.2018.02.033

Xiao, 2020, A novel car-following inertia gray model and its application in forecasting short-term traffic flow, Appl. Math. Model., 87, 546, 10.1016/j.apm.2020.06.020

Fayed, 2019, Speed up grid-search for parameter selection of support vector machines, Appl. Soft Comput., 80, 202, 10.1016/j.asoc.2019.03.037

Mao, 2020, Grey Lotka–Volterra model for the competition and cooperation between third-party online payment systems and online banking in China, Appl. Soft Comput., 10.1016/j.asoc.2020.106501

Rao, 2020, Study on the interactive influence between economic growth and environmental pollution, Environ. Sci. Pollut. Res., 10.1007/s11356-020-10017-6

Ji, 2020, A fuzzy-robust weighted approach for multicriteria bilevel games, IEEE Trans. Ind. Inf., 16, 5369, 10.1109/TII.2020.2969456

Jayadeva, 2019, Twin neural networks for the classification of large unbalanced data sets, Neurocomputing, 343, 34, 10.1016/j.neucom.2018.07.089