A software defect prediction method based on learnable three-line hybrid feature fusion

Expert Systems with Applications - Tập 239 - Trang 122409 - 2024
Yu Tang1,2, Qi Dai3, Ye Du1,2,4, Lifang Chen5,6, Xuanwen Niu1,2
1School of Computer and Information Technology, Beijing Jiaotong University, Beijing, 100000, China
2Beijing Key Laboratory of Security and Privacy in Intelligent Transportation, Beijing Jiaotong University, Beijing, 100000, China
3Department of Automation, College of Information Science and Engineering, China University of Petroleum, Beijing, Beijing, China
4Beijing Laboratory of National Economic Security Early-warning Engineering, Beijing Jiaotong University, Beijing 100000, China
5College of Science, North China University of Science and Technology, Hebei Tangshan 063210, China
6Hebei Key Laboratory of Data Science and Application, Hebei Tangshan 063210, China

Tài liệu tham khảo

Pandey, 2021, Machine learning based methods for software fault prediction: A survey, Expert Systems with Applications, 172, 114595, 10.1016/j.eswa.2021.114595 Rathi, 2023, Empirical evaluation of the performance of data sampling and feature selection techniques for software fault prediction, Expert Systems with Applications, 223, 119806, 10.1016/j.eswa.2023.119806 Pachouly, 2022, A systematic literature review on software defect prediction using artificial intelligence: Datasets, Data Validation Methods, Approaches, and Tools, Engineering Applications of Artificial Intelligence, 111, 104773, 10.1016/j.engappai.2022.104773 Kanwar, 2023, Candidate project selection in cross project defect prediction using hybrid method, Expert Systems with Applications, 218, 119625, 10.1016/j.eswa.2023.119625 Giray, 2023, On the use of deep learning in software defect prediction, Journal of Systems and Software, 195, 111537, 10.1016/j.jss.2022.111537 Jiang, 2022, A random approximate reduct-based ensemble learning approach and its application in software defect prediction, Information Sciences, 609, 1147, 10.1016/j.ins.2022.07.130 Sharma, 2023, Ensemble Machine Learning Paradigms in Software Defect Prediction, Procedia Computer Science, 218, 199, 10.1016/j.procs.2023.01.002 Nevendra, 2022, Empirical investigation of hyperparameter optimization for software defect count prediction, Expert Systems with Applications, 191, 116217, 10.1016/j.eswa.2021.116217 Zheng, 2022, Interpretability application of the Just-in-Time software defect prediction model, Journal of Systems and Software, 188, 111245, 10.1016/j.jss.2022.111245 Zhang, 2023, Mechanical fault intelligent diagnosis using attention-based dual-scale feature fusion capsule network, Measurement, 207, 112345, 10.1016/j.measurement.2022.112345 Jiang, 2023, Trimap-guided feature mining and fusion network for natural image matting, Computer Vision and Image Understanding, 230, 103645, 10.1016/j.cviu.2023.103645 Ni, 2023, A feature aggregation and feature fusion network for retinal vessel segmentation, Biomedical Signal Processing and Control, 85, 104829, 10.1016/j.bspc.2023.104829 Tang, 2023, Software defect prediction ensemble learning algorithm based on adaptive variable sparrow search algorithm, International Journal of Machine Learning and Cybernetics, 1 Pramanik, 2023, Breast cancer detection in thermograms using a hybrid of GA and GWO based deep feature selection method, Expert Systems with Applications, 219, 119643, 10.1016/j.eswa.2023.119643 Qu, 2023, Explicit and size-adaptive PSO-based feature selection for classification, Swarm and Evolutionary Computation, 77, 101249, 10.1016/j.swevo.2023.101249 Nadimi, 2022, Enhanced whale optimization algorithm for medical feature selection: A COVID-19 case study, Computers in Biology and Medicine, 148, 105858, 10.1016/j.compbiomed.2022.105858 Xue, 2016, IBED: Combining IBEA and DE for optimal feature selection in software product line engineering, Applied Soft Computing, 49, 1215, 10.1016/j.asoc.2016.07.040 Tubishat, 2021, Dynamic salp swarm algorithm for feature selection, Expert Systems with Applications, 164, 113873, 10.1016/j.eswa.2020.113873 Ma, 2022, Multi-strategy ensemble binary hunger games search for feature selection, Knowledge-Based Systems, 248, 108787, 10.1016/j.knosys.2022.108787 Karpagalingam, 2022, A feature selection model for software defect prediction using binary Rao optimization algorithm, Applied Soft Computing, 131, 109737, 10.1016/j.asoc.2022.109737 Zhu, 2021, Software defect prediction based on enhanced metaheuristic feature selection optimization and a hybrid deep neural network, Journal of Systems and Software, 180, 111026, 10.1016/j.jss.2021.111026 Ni, 2019, An empirical study on pareto based multi-objective feature selection for software defect prediction, Journal of Systems and Software, 152, 215, 10.1016/j.jss.2019.03.012 Li, 2020, A systematic review of unsupervised learning techniques for software defect prediction, Information and Software Technology, 122, 106287, 10.1016/j.infsof.2020.106287 Zhang, 2023, Development of a cross-scale weighted feature fusion network for hot-rolled steel surface defect detection, Engineering Applications of Artificial Intelligence, 117, 105628, 10.1016/j.engappai.2022.105628 Guan, 2023, Defect classification for specular surfaces based on deflectometry and multi-modal fusion network, Optics and Lasers in Engineering, 163, 107488, 10.1016/j.optlaseng.2023.107488 Li, 2021, Sewer pipe defect detection via deep learning with local and global feature fusion, Automation in Construction, 129, 103823, 10.1016/j.autcon.2021.103823 Luo, 2016, Nonlinear process monitoring based on kernel global-local preserving projections, Journal of Process Control, 38, 11, 10.1016/j.jprocont.2015.12.005 Kong, 2021, Surface roughness prediction using kernel locality preserving projection and Bayesian linear regression, Mechanical Systems and Signal Processing, 152, 107474, 10.1016/j.ymssp.2020.107474 Yang, 2022, A novel vibro-acoustic fault diagnosis method of rolling bearings via entropy-weighted nuisance attribute projection and orthogonal locality preserving projections under various operating conditions, Applied Acoustics, 196, 108889, 10.1016/j.apacoust.2022.108889 Tu, 2023, A coupling model of multi-feature fusion and multi-machine learning model integration for defect recognition, Journal of Magnetism and Magnetic Materials, 568, 170395, 10.1016/j.jmmm.2023.170395 Wang, 2021, Fault feature extraction of rolling bearing integrating KPCA and t-SNE, Journal of Vibration Engineering, 34, 431 Sun, 2019, Machine health monitoring based on locally linear embedding with kernel sparse representation for neighborhood optimization, Mechanical Systems and Signal Processing, 114, 25, 10.1016/j.ymssp.2018.04.044 Peng, 2022, Fault detection and quantitative assessment method for process industry based on feature fusion, Measurement, 197, 111267, 10.1016/j.measurement.2022.111267 Li, 2023, Automatic classification of interactive texts in online collaborative discussion based on multi-feature fusion, Computers and Electrical Engineering, 107, 108648, 10.1016/j.compeleceng.2023.108648 Su, 2023, Accurate polyp segmentation through enhancing feature fusion and boosting boundary performance, Neurocomputing, 545, 126233, 10.1016/j.neucom.2023.126233 Yang, 2023, Cross-subject classification of depression by using multiparadigm EEG feature fusion, Computer Methods and Programs in Biomedicine, 233, 107360, 10.1016/j.cmpb.2023.107360 Zhou, 2019, Software defect prediction model based on KPCA-SVM, 1326 Xu, 2019, Software defect prediction based on kernel PCA and weighted extreme learning machine, Information and Software Technology, 106, 182, 10.1016/j.infsof.2018.10.004 Zhang, 2020, KAEA: A Novel Three-stage Ensemble Model for Software Defect Prediction, Computers, Materials and Continua, 64, 471, 10.32604/cmc.2020.010117 Liu, 2021, Bearing feature extraction using multi-structure locally linear embedding, Neurocomputing, 428, 280, 10.1016/j.neucom.2020.11.048 Lu, 2022, Natural gas pipeline leak diagnosis based on improved variational modal decomposition and locally linear embedding feature extraction method, Process Safety and Environmental Protection, 164, 857, 10.1016/j.psep.2022.05.043 Ma, 2022, Automated screening of COVID-19 using two-dimensional variational mode decomposition and locally linear embedding, Biomedical Signal Processing and Control, 78, 103889, 10.1016/j.bspc.2022.103889 Nadal, 2019, Evaluation of hip fracture risk using a hyper-parametric model based on the Locally Linear Embedding technique, Comptes Rendus Mécanique, 347, 856, 10.1016/j.crme.2019.11.010 Xue, 2020, A novel swarm intelligence optimization approach: Sparrow search algorithm, Systems Science & Control Engineering, 8, 22, 10.1080/21642583.2019.1708830 Scholkopf, 1997, Kernel Principal Component Analysis, 583 Roweis, 2000, Nonlinear dimensionality reduction by locally linear embedding, Science, 290, 2323, 10.1126/science.290.5500.2323 Mushava, 2022, A novel XGBoost extension for credit scoring class-imbalanced data combining a generalized extreme value link and a modified focal loss function, Expert Systems with Applications, 202, 117233, 10.1016/j.eswa.2022.117233 Yuan, 1966, A preliminary report on male sterility in rice, Chinese Science Bulletin, 4, 185 Dai, 2022, Class-imbalanced positive instances augmentation via three-line hybrid, Knowledge-Based Systems, 257, 109902, 10.1016/j.knosys.2022.109902 Liu, 2022, A comparative study on the effect of data imbalance on software defect prediction, Procedia Computer Science, 214, 1603, 10.1016/j.procs.2022.11.349 Feng, 2021, COSTE: Complexity-based OverSampling TEchnique to alleviate the class imbalance problem in software defect prediction, Information and Software Technology, 129, 106432, 10.1016/j.infsof.2020.106432 Zheng, 2021, Training data selection for imbalanced cross-project defect prediction, Computers & Electrical Engineering, 94, 107370, 10.1016/j.compeleceng.2021.107370 Pandey, 2023, Software Fault Prediction for Imbalanced Data: A Survey on Recent Developments, Procedia Computer Science, 218, 1815, 10.1016/j.procs.2023.01.159 José, 2015, SMOTE–IPF: Addressing the noisy and borderline examples problem in imbalanced classification by a re-sampling method with filtering, Information Sciences, 291, 184, 10.1016/j.ins.2014.08.051 György, 2019, An empirical comparison and evaluation of minority oversampling techniques on a large number of imbalanced datasets, Applied Soft Computing, 83, 105662, 10.1016/j.asoc.2019.105662 Ziegelmeier, 2017, Sparse locally linear embedding, Procedia Computer Science, 108, 635, 10.1016/j.procs.2017.05.171 Mohammed, M. Z., & Saleh, I. A. (2022). Predicted of Software Fault Based on Random Forest and K-Nearest Neighbor. 2022 4th International Conference on Advanced Science and Engineering, 43-48. Shen, Y., Hu, S., Cai, S., & Chen, M. (2022). Software Defect Prediction based on Bayesian Optimization Random Forest. // 2022 9th International Conference on Dependable Systems and Their Applications, 1012-1013. Zhang, 2023, Multi-View Fusion Neural Network for Traffic Demand Prediction, Information Sciences, 119303 Tong, 2022, SHSE: A subspace hybrid sampling ensemble method for software defect number prediction, Information and Software Technology, 142, 106747, 10.1016/j.infsof.2021.106747 Jin, 2021, Cross-project software defect prediction based on domain adaptation learning and optimization, Expert Systems with Applications, 171, 114637, 10.1016/j.eswa.2021.114637 Okumoto, 2022, Early Software Defect Prediction: Right-Shifting Software Effort Data into a Defect Curve, IEEE International Symposium on Software Reliability Engineering Workshops, 2022, 43 Ran, Y., Xu, X. M., & Xu, Z. W. (2022). Research and Application of Software Defect Prediction Model based on Data Mining. // 2022 IEEE International Conference on Sensing, Diagnostics, Prognostics, and Control, 367-371. Xi, L., Lu, H. F., & Xu, Y. X. (2020). Intelligent radar software defect prediction approach and its application. 2020 IEEE 20th International Conference on Software Quality, Reliability and Security Companion, 32-37. García, 2012, Evolutionary-based selection of generalized instances for imbalanced classification, Knowledge-Based Systems, 25, 3, 10.1016/j.knosys.2011.01.012