Advanced hybrid ensemble gain ratio feature selection model using machine learning for enhanced disease risk prediction

Informatics in Medicine Unlocked - Tập 32 - Trang 101064 - 2022
Syed Javeed Pasha1, E. Syed Mohamed2
1Department of Computer Applications, School of Computer Information and Mathematical Sciences, B.S. Abdur Rahman Crescent Institute of Science and Technology, Chennai, India
2Department of Computer Science and Engineering, School of Computer Information and Mathematical Sciences, B.S. Abdur Rahman Crescent Institute of Science and Technology, Chennai, India

Tài liệu tham khảo

Brause, 2001, Medical analysis and diagnosis by neural networks, Lect Notes Comput Sci, 2199, 1, 10.1007/3-540-45497-7_1 Bishop, 2006 Liu, 2005, Toward integrating feature selection algorithms for classification and clustering, IEEE Trans Knowl Data Eng, 17, 491, 10.1109/TKDE.2005.66 Liu, 1998 Piramuthu, 2009, Iterative feature construction for improving inductive learning algorithms, Expert Syst Appl, 36, 3401, 10.1016/j.eswa.2008.02.010 Guyon, 2003, An introduction to variable and feature selection, J Mach Learn Res, 3, 1157 Pasha, 2020, Ensemble gain ratio feature selection (EGFS) model with machine learning and data mining algorithms for disease risk prediction, 590 Latha, 2019, Improving the accuracy of prediction of heart disease risk based on ensemble classification techniques, Inform Med Unlocked, 16 Gupta, 2020, MIFH: a machine intelligence framework for heart disease diagnosis, IEEE Access, 8, 14659, 10.1109/ACCESS.2019.2962755 Robinson, 2017, The case of tomato in Ghana : processing, Eur J Hortic Sci, 5, 1 Pasha, 2020, Novel Feature Reduction (NFR) model with machine learning and data mining algorithms for effective disease risk prediction, IEEE Access, 8, 184087, 10.1109/ACCESS.2020.3028714 Rani, 2021, A decision support system for heart disease prediction based upon machine learning, J. Reliab. Intell. Environ., 7, 263, 10.1007/s40860-021-00133-6 Shah, 2017, Feature extraction through parallel Probabilistic Principal Component Analysis for heart disease diagnosis, Phys. A Stat. Mech. its Appl., 482, 796, 10.1016/j.physa.2017.04.113 Saqlain, 2019, Fisher score and Matthews correlation coefficient-based feature subset selection for heart disease diagnosis using support vector machines, Knowl Inf Syst, 58, 139, 10.1007/s10115-018-1185-y Amin, 2019, Identification of significant features and data mining techniques in predicting heart disease, Telematics Inf, 36, 82, 10.1016/j.tele.2018.11.007 Nagarajan, 2021, Innovative feature selection and classification model for heart disease prediction, J. Reliab. Intell. Environ., 10.1007/s40860-021-00152-3 Uddin, 2021, An ensemble method based multilayer dynamic system to predict cardiovascular disease using machine learning approach, Inform Med Unlocked, 24, 10.1016/j.imu.2021.100584 Pasha, 2019, Bio inspired ensemble feature selection (BEFS) model with machine learning and data mining algorithms for disease risk prediction Xu, 2015, Structural least square twin support vector machine for classification, Appl Intell, 42, 527, 10.1007/s10489-014-0611-4 Mokeddem, 2018, A fuzzy classification model for myocardial infarction risk assessment, Appl Intell, 48, 1233 Shilaskar, 2018, Diagnosis system for imbalanced multi-minority medical dataset, Soft Comput, 23, 1 Maji, 2019, Decision tree algorithms for prediction of heart disease, Inf. Commun. Technol. Compet. Strateg., 40, 447 Abdar, 2019, NE-nu-SVC: a new nested ensemble clinical decision support system for effective diagnosis of coronary artery disease, IEEE Access, 7, 167605, 10.1109/ACCESS.2019.2953920 Ali, 2019, An automated diagnostic system for heart disease prediction based on χ2 statistical model and optimally configured deep neural network, IEEE Access, 7, 34938, 10.1109/ACCESS.2019.2904800 Domor, 2020, Informatics in Medicine Unlocked an improved ensemble learning approach for the prediction of heart disease risk, Inform Med Unlocked, 20 Hajjam, 2020, vol. 19 Sarkar, 2020, Hybrid model for prediction of heart disease, Soft Comput, 24, 1903, 10.1007/s00500-019-04022-2 Perales-González, 2019, Regularized ensemble neural networks models in the Extreme Learning Machine framework, Neurocomputing, 361, 196, 10.1016/j.neucom.2019.06.040 Hameed, 2021, Efficient hybrid algorithm based on genetic with weighted fuzzy rule for developing a decision support system in prediction of heart diseases, J Supercomput, 77, 10117, 10.1007/s11227-021-03677-9 Yang, 2018, An improved Id3 algorithm for medical data classification, Comput Electr Eng, 65, 474, 10.1016/j.compeleceng.2017.08.005 Ding, 2019, Protein crystallization identification via fuzzy model on linear neighborhood representation, IEEE ACM Trans Comput Biol Bioinf, 5963 Hussain, 2021, An efficient hybrid sine-cosine Harris hawks optimization for low and high-dimensional feature selection, Expert Syst Appl, 176 Hossein, 2019, An Efficient hybrid filter-wrapper metaheuristic-based gene selection method for high dimensional datasets, Nature, 1 Das, 2020, Biomedical data analysis using neuro-fuzzy model with post-feature reduction, J. King Saud Univ. - Comput. Inf. Sci., 1 Anooj, 2011, Clinical decision support system: risk level prediction of heart disease using weighted fuzzy rules and decision tree rules, Open Comput. Sci., 1, 27, 10.2478/s13537-011-0032-y Paul, 2018, Adaptive weighted fuzzy rule-based system for the risk level assessment of heart disease, Appl Intell, 48, 1739, 10.1007/s10489-017-1037-6 Sabahi, 2018, Bimodal fuzzy analytic hierarchy process (BFAHP) for coronary heart disease risk assessment, J Biomed Inf, 83, 204, 10.1016/j.jbi.2018.03.016 Reddy, 2019, Hybrid genetic algorithm and a fuzzy logic classifier for heart disease diagnosis, Evol. Intell., 185 V Chawla, 2002 Han, 2011 Unler, 2011, Mr2PSO: a maximum relevance minimum redundancy feature selection method based on swarm intelligence for support vector machine classification, Inf Sci (Ny), 181, 4625, 10.1016/j.ins.2010.05.037 Huang, 2005, Using AUC and accuracy in evaluating learning algorithms, IEEE Trans Knowl Data Eng, 17, 299, 10.1109/TKDE.2005.50 Hosmer, 2000, vol. 1, 373 Breiman, 1993 Breiman, 2001, Random forests, Mach Learn, 45, 5, 10.1023/A:1010933404324 Schapire, 2003, The boosting approach to machine learning: an overview, 149 Friedman, 2002, Stochastic gradient boosting, Comput Stat Data Anal, 38, 367, 10.1016/S0167-9473(01)00065-2 Caruana, 2006, An empirical comparison of supervised learning algorithms, 161 Moisen, 2006 Elith, 2008, A working guide to boosted regression trees, J Anim Ecol, 77, 802, 10.1111/j.1365-2656.2008.01390.x Millet-Roig, 2000, Support vector machine for arrhythmia discrimination with wavelet transform-based feature selection, vol. 27, 407 Lin, 2008, Parameter determination of support vector machine and feature selection using simulated annealing approach, Appl Soft Comput J, 8, 1505, 10.1016/j.asoc.2007.10.012 Hall Witten, 2016 Quinlan, 1986, Induction of decision trees, Mach Learn, 1, 81, 10.1007/BF00116251 Alam, 2019, A Random Forest based predictor for medical data classification using feature ranking, Inform Med Unlocked, 15