Prediction of Skin Disease with Three Different Feature Selection Techniques Using Stacking Ensemble Method
Tóm tắt
Skin disease is the most common problem between people. Due to pollution and deployment of ozone layer, harmful UV rays of sun burn the skin and develop various types of skin diseases. Nowadays, machine learning and deep learning algorithms are generally used for diagnosis for various kinds of diseases. In this study, we have applied three feature extraction techniques univariate feature selection, feature importance, and correlation matrix with heat map to find the optimum data subset of erythemato-squamous disease. Four classification techniques Gaussian Naïve Bayesian (NB), decision tree (DT), support vector machine (SVM), and random forest are used for measuring the performance of model. Stacking ensemble technique is then applied to enhance the prediction performance of the model. The proposed method used for measuring the performance of the model. It is finding that the optimal subset of the erythemato-squamous disease is performed well in the case of correlation and heat map feature selection techniques. The mean value, slandered deviation, root mean square error, kappa statistical error, and area under receiver operating characteristics and accuracy are calculated for demonstrating the effectiveness of the proposed model. The feature selection techniques applied with staking ensemble technique gives the better result as compared to individual machine learning techniques. The obtained results show that the performance of proposed model is higher than previous results obtained by researchers.
Tài liệu tham khảo
Güvenir, H. A., Demiröz, G., & Ilter, N. (1998). Learning differential diagnosis of erythemato-squamous diseases using voting feature intervals. Artificial Intelligence in Medicine, 13(3), 147–165. https://doi.org/10.1016/S0933-3657(98)00028-1.
Barati, E., Saraee, M., Mohammadi, A., Adibi, N., & Ahamadzadeh, M. R. (2011). A survey on utilization of data mining approaches for dermatological (skin) diseases prediction. Journal of Selected Areas in Health Informatics, 2(3), 1–11.
Ravichandran, K. S., Narayanamurthy, B., Ganapathy, G., Ravalli, S., & Sindhura, J. (2014). An efficient approach to an automatic detection of erythemato-squamous diseases. Neural Computing and Applications, 25(1), 105–114. https://doi.org/10.1007/s00521-013-1452-5.
Badrinath, N., Gopinath, G., Ravichandran, K. S., & Soundhar, R. G. (2016). Estimation of automatic detection of erythemato-squamous diseases through AdaBoost and its hybrid classifiers. Artificial Intelligence Review, 45(4), 471–488. https://doi.org/10.1007/s10462-015-9436-8.
Verma, A. K., Pal, S., & Kumar, S. (2019). Prediction of skin disease using ensemble data mining techniques and feature selection method—a comparative study. Applied Biochemistry and Biotechnology, 1–19. https://doi.org/10.1007/s12010-019-03093-z.
Xie, J., Lei, J., Xie, W., Shi, Y., & Liu, X. (2013). Two-stage hybrid feature selection algorithms for diagnosing erythemato-squamous diseases. Health Information Science and Systems, 1(1), 1–14. https://doi.org/10.1186/2047-2501-1-10.
Übeyli, E. D., & Doǧdu, E. (2010). Automatic detection of erythemato-squamous diseases using κ-means clustering. Journal of Medical Systems, 34(2), 179–184. https://doi.org/10.1007/s10916-008-9229-6.
Manjusha, K. K., Sankaranarayanan, K., & Seena, P. (2015). Data mining in dermatological diagnosis: a method for severity prediction. International Journal of Computer Applications, 117(11), 11–14. https://doi.org/10.5120/20597-3102.
Ozcift, A., & Gulten, A. (2012). A robust multi-class feature selection strategy based on rotation forest ensemble algorithm for diagnosis of erythemato-squamous diseases. Journal of Medical Systems, 36(2), 941–949. https://doi.org/10.1007/s10916-010-9558-0.
Giveki, D. (2012). Detection of erythemato-squamous diseases using AR-CatfishBPSO-KSVM. Signal & Image Processing : An International Journal, 2(4), 57–72. https://doi.org/10.5121/sipij.2011.2406.
Kabari, L. G., & Bakpo, F. S. (2009). Diagnosing skin diseases using an artificial neural network. ICAST 2009 - 2nd International Conference on Adaptive Science and Technology, 187–191. https://doi.org/10.1109/ICASTECH.2009.5409725.
Aruna, S., Nandakishore, V. L., & Rajagopalan, P. S. (2012). A hybrid feature selection method based on IGSBFS and naive Bayes for the diagnosis of erythemato-squamous diseases. International Journal of Computer Applications, 41(7), 13–18. https://doi.org/10.5120/5552-7623.
Almarabeh, H., & Ehab, F. E. (2017). A study of data mining techniques accuracy for healthcare. International Journal of Computer Applications, 168(3), 12–17. https://doi.org/10.5120/ijca2017914338.
Maryam, Setiawan, N. A., & Wahyunggoro, O. (2017). A hybrid feature selection method using multiclass SVM for diagnosis of erythemato-squamous disease. AIP Conference Proceedings, 1867(August). https://doi.org/10.1063/1.4994451.
Verma, A. K., Pal, S., & Kumar, S. (2019). Classification of skin disease using ensemble data mining techniques. Asian Pacific Journal of Cancer Prevention : APJCP, 20(6), 1887–1894. https://doi.org/10.31557/APJCP.2019.20.6.1887.
Chaurasia, V., & Pal, S. (2019). Skin diseases prediction: binary classification machine learning and multi model ensemble techniques. Research Journal of Pharmacy and Technology, 12(August), 3829–3832. https://doi.org/10.5958/0974-360X.2019.00656.5.
Wang, N., Xu, H. L., Zhao, X., Wen, X., Wang, F. T., Wang, S. Y., Fu, L. L., Liu, B., & Bao, J. K. (2012). Network-based identification of novel connections among apoptotic signaling pathways in cancer. Applied Biochemistry and Biotechnology, 167(3), 621–631. https://doi.org/10.1007/s12010-012-9704-x.
Banerjee, A. K., Ravi, V., Murty, U. S. N., Sengupta, N., & Karuna, B. (2013). Application of intelligent techniques for classification of bacteria using protein sequence-derived features. Applied Biochemistry and Biotechnology, 170(6), 1263–1281. https://doi.org/10.1007/s12010-013-0268-1.
Behbahani, M., Nosrati, M., & Moradi, M. (2019). Using Chou’s general pseudo amino acid composition to classify laccases from bacterial and fungal sources via Chou’s five-step rule. Applied Biochemistry and Biotechnology, 1–14. https://doi.org/10.1007/s12010-019-03141-8.
Polat, K., & Güneş, S. (2009). A novel hybrid intelligent method based on C4.5 decision tree classifier and one-against-all approach for multi-class classification problems. Expert Systems with Applications, 36(2), 1587–1592. https://doi.org/10.1016/J.ESWA.2007.11.051.
Übeyli, E. D. (2009). Combined neural networks for diagnosis of erythemato-squamous diseases. Expert Systems with Applications, 36(3), 5107–5112. https://doi.org/10.1016/J.ESWA.2008.06.002.
Chang, C. L., & Chen, C. H. (2009). Applying decision tree and neural network to increase quality of dermatologic diagnosis. Expert Systems with Applications, 36(2), 4035–4041. https://doi.org/10.1016/J.ESWA.2008.03.007.
Lekkas, S., & Mikhailov, L. (2010). Evolving fuzzy medical diagnosis of Pima Indians diabetes and of dermatological diseases. Artificial Intelligence in Medicine, 50(2), 117–126. https://doi.org/10.1016/J.ARTMED.2010.05.007.
Xie, J., & Wang, C. (2011). Using support vector machines with a novel hybrid feature selection method for diagnosis of erythemato-squamous diseases. Expert Systems with Applications, 38(5), 5809–5815. https://doi.org/10.1016/J.ESWA.2010.10.050.
Çataloluk, H., & Kesler, M. (2012). A diagnostic software tool for skin diseases with basic and weighted K-NN. INISTA 2012 - International Symposium on INnovations in Intelligent SysTems and Applications, 0–3. https://doi.org/10.1109/INISTA.2012.6246999.
Olatunji, S. O., & Arif, H. (2013). Identification of erythemato-squamous skin diseases using extreme learning machine and artificial neural network. ICTACT Journal on Soft Computing, 4(1), 627–632. https://doi.org/10.21917/ijsc.2013.0090.
Sharma, D., & Hota, H. (2013). Data mining techniques for prediction of different categories of dermatology diseases. Journal of Management Information and Decision Sciences, 16(2), 103.
Olatunji, S., & Arif, H. (2014). Identification of erythemato-squamous skin diseases using support vector machines and extreme learning machines: a comparative study towards effective diagnosis. Transactions on Machine Learning and Artificial Intelligence, 2(6). https://doi.org/10.14738/tmlai.26.812.
Amarathunga, A. A. L. C., Ellawala, E. P. W. C., Abeysekara, G. N., & Amalraj, C. R. J. (2015). Expert system for diagnosis of skin diseases. International Journal of Scientific & Technology Research, 4(1), 174–178.
Parikh, K. S., Shah, T. P., Kota, R. K., & Vora, R. (2015). Diagnosing common skin diseases using soft computing techniques. International Journal of Bio-Science and Bio-Technology, 7(6), 275–286. https://doi.org/10.14257/ijbsbt.2015.7.6.28.
Maghooli, K., Langarizadeh, M., Shahmoradi, L., Habibi-Koolaee, M., Jebraeily, M., & Bouraghi, H. (2016). Differential diagnosis of erythemato-squamous diseases using classification and regression tree. Acta informatica medica : AIM : journal of the Society for Medical Informatics of Bosnia & Herzegovina : casopis Drustva za medicinsku informatiku BiH, 24(5), 338–342. https://doi.org/10.5455/aim.2016.24.338-342.
Zhou, H., Xie, F., Jiang, Z., Liu, J., Wang, S., & Zhu, C. (2017). Multi-classification of skin diseases for dermoscopy images using deep learning. In 2017 IEEE International Conference on Imaging Systems and Techniques (IST) (pp. 1–5). IEEE. https://doi.org/10.1109/IST.2017.8261543.
Idoko, J. B., Arslan, M., & Abiyev, R. (2018). Fuzzy neural system application to differential diagnosis of erythemato-squamous diseases. Cyprus Journal of Medical Sciences, 90–97. https://doi.org/10.5152/cjms.2018.576.
Zhang, X., Wang, S., Liu, J., & Tao, C. (2018). Towards improving diagnosis of skin diseases by combining deep neural network and human knowledge. BMC Medical Informatics and Decision Making, 18(S2), 59. https://doi.org/10.1186/s12911-018-0631-9.