A robust and consistent stack generalized ensemble-learning framework for image segmentation
Tóm tắt
In the present study, we aim to propose an effective and robust ensemble-learning approach with stacked generalization for image segmentation. Initially, the input images are processed for feature extraction and edge detection using the Gabor filter and the Canny algorithms, respectively; our main goal is to determine the most feature descriptions. Subsequently, we applied the stacking generalization technique, which is generally built with two main learning levels. The first level is composed of two algorithms that give good results in the literature, namely: LightGBM (Light Gradient Boosting Machine) and SVM (support vector machine). The second level is the meta-model in which we use a predictor model that takes the base-level predictions to improve the accuracy of the final prediction. In the stacked generalization process, we use the Extreme Gradient Boosting (XGBoost); it takes as input the sub-models’ outputs to better classify each pixel of the image to give the final prediction. Today, several research works exist in the literature using different machine learning algorithms; in fact, instead of trying to find a single efficient and optimal learner, ensemble-based techniques take the advantage of each basic model; they integrate their outputs to obtain a more consistent and reliable learner. The result obtained from the models of individuals and our proposed approach is compared using a set of evaluation measures for image quality such as IoU, DSC, CC, SSIM, SAM, and UQI. The evaluation and a comparison of the results obtained showed more consistent predictions for the proposed model. Thus, we have made a comparison with some recent deep learning-based unsupervised segmentation methods. The evaluation and a comparison of the results obtained showed more coherent predictions for our stacked generalization in terms of precision, robustness, and consistency.
Tài liệu tham khảo
Khrissi L, El Akkad N, Satori H, Satori K (2022) Clustering method and sine cosine algorithm for image segmentation. Evol Intel 15(1):669–682
Khrissi L, Satori H, Satori K, el Akkad N (2021) An Efficient Image Clustering Technique based on Fuzzy C-means and Cuckoo Search Algorithm. Int J Adv Comput Sci Appl 12(6):423–432. https://doi.org/10.14569/IJACSA.2021.0120647.
Aliniya Z, Mirroshandel SA (2019) A novel combinatorial merge-split approach for automatic clustering using imperialist competitive algorithm. Expert Syst Appl 117:243–266
Javed A, Kim YC, Khoo MC, Ward SLD, Nayak KS (2015) Dynamic 3-D MR visualization and detection of upper airway obstruction during sleep using region-growing segmentation. IEEE Trans Biomed Eng 63(2):431–437
Chen X, Williams BM, Vallabhaneni SR, Czanner G, Williams R, Zheng Y (2019) Learning active contour models for medical image segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp 11632–11640
Wang X, Wang S, Zhu Y, Meng X (2012) Image segmentation based on support vector machine. In: Proceedings of 2012 2nd International Conference on Computer Science and Network Technology. pp 202–206. https://doi.org/10.1109/ICCSNT.2012.6525921
Faska Z, Khrissi L, Haddouch K, El Akkad N (2021) A powerful and efficient method of image segmentation based on random forest algorithm. In: International Conference on Digital Technologies and Applications. Springer, Cham, pp 893–903
Khrissi L, El Akkad N, Satori H, Satori K (2020) Image segmentation based on k-means and genetic algorithms. In: Embedded systems and artificial intelligence. Springer, Singapore, pp 489–497
Moussaoui H, Benslimane M, El Akkad N (2022) Image segmentation approach based on hybridization between K-means and Mask R-CNN. In: WITS 2020. Springer, Singapore, pp 821–830
Gangwar S, Chauhan RP (2015) Survey of clustering techniques enhancing image segmentation process. In: International Conference on Advances in Computing and Communication Engineering. pp 34–39
Canny J (1986) A computational approach to edge detection. IEEE Trans Pattern Anal Mach Intell 8(6):679–698
Deriche R (1987) Using Canny’s criteria to derive a recursively implemented optimal edge detector. Int J Comput Vision 1(2):167–187
Karoui I, Fablet R, Boucher J, Augustin J (2010) Variational region-based segmentation using multiple texture statistics. IEEE Trans Image Process 19(12):3146–3156
Naz S, Majeed H, Irshad H (2010) Image segmentation using fuzzy clustering: a survey. In: International conference on emerging technologies. pp 181–186
Rambabu C, Chakrabarti I, Mahanta A (2004) Flooding-based watershed algorithm and its prototype hardware architecture. IEEE Proc Vision Image Signal Process 151(3):224–234
Jiang Y, Zhao K, Xia K, Xue J, Zhou L, Ding Y, Qian P (2019) A novel distributed multitask fuzzy clustering algorithm for automatic MR brain image segmentation. J Med Syst 43(5):118
Chen T, Guestrin C (2016) XGBoost: a scalable tree boosting system. In: Proceedings of the 22Nd ACM SIGKDD international conference on knowledge discovery and data mining. pp 785–794. https://doi.org/10.1145/2939672.2939785
Yang XD, Wang JM, Zhang LN (2017) Application of XGBoost in ultra-short term load forecasting. Electr Drive Autom 39:21–25
Zhao T, Zheng S, Li W (2018) Research on credit risk analysis based on XGBoost. Softw Eng 21:33–35
Wang C, Deng C, Wang S (2019) Imbalance-XGBoost: leveraging weighted and focal losses for binary label-imbalanced classification with XGBoost, arXiv. Available online: https://arxiv.org/abs/1908.01672. Accessed 24 Jan 2021
Zhu S, Zhu F (2019) Cycling comfort evaluation with instrumented probe bicycle. Transp Res Part A Policy Pract 129:217–231. https://doi.org/10.1016/j.tra.2019.08.009
Mo H, Sun H, Liu J, Wei S (2019) Developing window behavior models for residential buildings using XGBoost algorithm. Energy Build 205:109564. https://doi.org/10.1016/j.enbuild.2019.109564
Yue L, Yi Z, Pan J, Li X, Li J (2021) Identify M subdwarfs from M-type Spectra using XGBoost. Optik 225:165535. https://doi.org/10.1016/j.ijleo.2020.165535
Reinstein I (2017) XGBoost a top machine learning method on Kaggle, explained. Available online: http://www.kdnuggets.com/2017/10/xgboost-topmachine-learning-method-kaggle-explained.html. Accessed 23 Jan 2021
Tianqi C, Guestrin C (2016) XGBoost: a scalable tree boosting system. In: Proc. 22nd ACM SIGKDD Int. Conf. Knowl. Discovery Data Mining. pp 785–794
Wang L, Wu C, Tang L, Zhang W, Lacasse S, Liu H, Gao L (2020) Efficient reliability analysis of earth dam slope stability using Extreme Gradient Boosting method. Acta Geotech 15(11):3135e3150
Ke GL, Meng Q, Finley T, Wang TF, Chen W, Ma WD, Ye QW, Liu TY (2017) LightGBM: a highly efficient gradient boosting decision tree. In: Proceedings of the 31st Annual Conference on Neural Information Processing Systems, Long Beach, CA, USA. pp 3146–3154
Vapnik V, Cortes C (1995) Support-vector networks. Mach Learn 20:273e297
Vapnik VN (1998) Statistical learning theory. Wiley, New York
Zhang C, Chen X, Chen M, Chen S-C (2005) A multiple instance learning approach for content based image retrieval using one-class support vector machine. In: Proceedings of the IEEE International Conference on Multimedia and Expo. pp 1142–1145
Zhang L, Lin F, Zhang B (2001) Support vector machine learning for image retrieval. In: Proceedings of the IEEE International Conference on Image Processing. pp 721–724
Shao H, Jiang H, Lin Y, Li X (2018) A novel method for intelligent fault diagnosis of rolling bearings using ensemble deep auto-encoders. Mech Syst Signal Process 102:278–297
Re M, Valentini G (2012) Ensemble methods: a review. In: Advances in Machine Learning and Data Mining for Astronomy. London, United Kingdom: Chapman & Hall.
Dietterich TG (2000) Ensemble Methods in Machine Learning. In: Multiple Classifier Systems. MCS 2000. Lecture Notes in Computer Science. Springer, Berlin, Heidelberg, Vol. 1857 pp 1–15. https://doi.org/10.1007/3-540-45014-9_1.
Zhou J, Peng T, Zhang C, Sun N (2018) Data pre-analysis and ensemble of various artificial neural networks for monthly streamflow forecasting. Water 10:628
David B (2018) Online cross-validation-based ensemble learning. Stat Med 2:37
Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W, Ye Q, Liu TY (2017) LightGBM: a highly efficient gradient boosting decision tree. In: 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA
Hu CA, Chen CM, Fang YC, Liang SJ, Wang HC, Fang WF, Sheu CC, Perng WC, Yang KY, Kao KC, Wu CL, Tsai CS, Lin MY, Chao WC (2020) Using a machine learning approach to predict mortality in critically ill influenza patients: a cross-sectional retrospective multicentre study in Taiwan. BMJ Open 10(2):e033898
Website of the Kaggle. Available online: https://www.kaggle.com/. Accessed 15 Dec 2021
Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W, Ye Q, Liu T-Y (2017) LightGBM: a highly efficient gradient boosting decision tree. In: Advances in neural information processing systems. pp 3146–3154. http://www.audentia-gestion.fr/MICROSOFT/lightgbm.pdf
Kodaz H, Özşen S, Arslan A, Güneş S (2009) Medical application of information gain based artificial immune recognition system (AIRS): diagnosis of thyroid disease. Expert Syst Appl 36:3086–3092
Mountrakis G, Im J, Ogole C (2011) Support vector machines in remote sensing: a review. ISPRS J Photogramm Remote Sens 66:247–259
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20:273–297
Wolpert DH (1992) Stacked generalization. Neural Netw 5:241e259. https://doi.org/10.1016/S0893-6080(05)80023-1
Ren Y, Zhang L, Suganthan PN (2016) Ensemble classification and regression-recent developments, applications and futuredirections. IEEE Comput Intell Mag 11(1):41–53
Mitchell TM, Keller RM, Kedar-Cabelli ST (1986) Explanationbased generalization: a unifying view. Mach Learn 1(1):47–80
Canny J (1989) A computational approach to edge detection. In: IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-8(6). pp 679–698
Arbelaez P, Maire M, Fowlkes C, Malik J (2011) Contour detection and hierarchical image segmentation. IEEE Trans Pattern Anal Mach Intell 33:898–916
Kumar P (2019) Machine learning quick reference. Packt Publishing Ltd., Birmingham
Hsu C, Chang C, Lin C (2003) A practical guide to support vector classification. pp 1–16
Yilmaz V, Gungor O (2016) Determining the optimum image fusion method for better interpretation of the surface of the Earth. Nor Geogr Tidsskr 70(2):69–81
Alparone L, Wald L, Chanussot J, Member S, Thomas C, Gamba P, Bruce LM (2007) Comparison of pansharpening algorithms: outcome of the 2006 GRS-S data-fusion contest. IEEE Trans Geosci Remote Sens 45:3012–3021
Wang Z, Bovik AC, Sheikh HR, Member S, Simoncelli EP, Member S (2004) Image quality assessment: from error visibility to structural similarity. IEEE Trans Image Process 13:1–14
Alparone L, Aiazzi B, Baronti S, Garzelli A, Nencini F, Selva M (2008) Multispectral and panchromatic data fusion assessment without reference. Photogramm Eng Remote Sens 74:193–200. https://doi.org/10.14358/PERS.74.2.193
Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal Mach Intell 22(8):888–905
Yang AY, Wright J, Ma Y, Sastry SS (2008) Unsupervised segmentation of natural images via lossy data compression. Comput Vis Image Underst 110(2):212–225
Deng Y, Manjunath B (2001) Unsupervised segmentation of color-texture regions in images and video. IEEE Trans Pattern Anal Mach Intell 23(8):800–810
Li Z, Wu X-M, Chang S-F Chang (2012) Segmentation using superpixels: A bipartite graph partitioning approach, in Proc. of IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE 789–796.
Kanezaki A (2018) Unsupervised Image Segmentation by Backpropagation. 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, AB, Canada 1543-1547. https://doi.org/10.1109/ICASSP.2018.8462533.
Xia X, Kulis B (2017) W-Net: a deep model for fully unsupervised image segmentation, arXiv preprint arXiv:1711.08506
Khan Z, Yang J (2020) Bottom-up unsupervised image segmentation using FC-Dense U-Net based deep representation clustering and multidimensional feature fusion-based region merging. Image Vis Comput 94:1–11. https://doi.org/10.1016/j.imavis.2020.103871Elsevier
Zhang Y, Zhang H, Guo Y, Lin K, He J (2019) An adaptive affinity graph with subspace pursuit for natural image segmentation. In: 2019 IEEE International Conference on Multimedia and Expo (ICME), Shanghai, China. pp 802–807
Donoser M, Schmalstieg D (2014) Discrete-continuous gradient orientation estimation for faster image segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 3158–3165
Tan KS, Ashidi MIN (2001) Color image segmentation using histogram thresholding—fuzzy C-means hybrid approach. Pattern Recognit 44:1–15
Zhang YX, Bai XZ, Fan RR, Wang ZH (2019) Deviation-sparse fuzzy C-means with neighbor information constraint. IEEE Trans Fuzzy Syst 27(1):185–199
Cour T et al (2005) Spectral segmentation with multiscale graph decomposition. IEEE Conf Comput Vision Pattern Recognit 2:1124–1131
Arbelaez P (2006) Boundary extraction in natural images using ultra-metric contour maps. In: IEEE Conference on Computer Vision and Pattern Recognition Workshop. pp 182–182
Vedaldi A, Soatto S (2008) Quick shift and kernel methods for mode seeking. In: European Conference on Computer Vision. pp 705–718