Enhancing flood prediction in Southern West Bengal, India using ensemble machine learning models optimized with symbiotic organisms search algorithm

Gilbert Hinge1, Swati Sirsant2, Amandeep Kumar1, Ruchika Gupta1, Mohamed A. Hamouda3,4
1Department of Civil Engineering, National Institute of Technology Durgapur, Durgapur, India
2Department of Civil Engineering, Nirma University, Ahmedabad, India
3National Water and Energy Centre, United Arab Emirates University, Al Ain, United Arab Emirates
4Department of Civil and Environmental Engineering, United Arab Emirates University, Al Ain, United Arab Emirates

Tóm tắt

In regions with limited flow and catchment data needed for the configuration and calibration of hydraulic and hydrological models, employing spatial flood modeling and mapping enables authorities to predict the spatial extent and severity of floods. This study leveraged flood inventory data, coupled with various conditional variables, to formulate a novel Ensemble model. This ensemble model combined four hybridized models based on Support Vector Machine (SVM), Naïve Bayes (NB), Decision Classification Tree (DCT), and Artificial Neural Network (ANN), all of which were optimized using the metaheuristic Symbiotic Organisms Search algorithm (SOS). The precision of the flood inundation map generated by the four hybrid models and the ensemble model was assessed using standard metrics. The results demonstrated that the ensemble model outperformed other models, with an accuracy metric of 0.99 Area Under the Curve (AUC) during the training stage and 0.96 during the testing stage. This underscores the effectiveness of the ensemble approach in flood preparedness and response applications. Furthermore, a comparison was conducted, comparing the performance of the developed ensemble model against other studies within the state of West Bengal. The findings highlighted a significant improvement in the ensemble model's performance with an AUC score of 0.96 in validation compared to studies in similar areas within West Bengal with AUC score ranged from 0.73 to 0.92. In conclusion, the methodology employed in this study holds promise for application in other regions worldwide that face challenges related to limited data availability for accurate flood inundation mapping.

Từ khóa


Tài liệu tham khảo

Abedi R, Costache R, Shafizadeh-Moghadam H, Pham QB (2021) Flash-flood susceptibility mapping based on XGBoost, random forest and boosted regression trees. Geocarto Int 37:5479–5496. https://doi.org/10.1080/10106049.2021.1920636 Arabameri A, Saha S, Mukherjee K et al (2020) Modeling spatial flood using novel ensemble artificial intelligence approaches in northern Iran. Remote Sens 12:3423 Aydin HE, Iban MC (2023) Predicting and analyzing flood susceptibility using boosting-based ensemble machine learning algorithms with SHapley Additive exPlanations. Nat Hazards 116:2957–2991 Bandyopadhyay S, Kar NS, Das S, Sen J (2014) River systems and water resources of West Bengal: a review. Geol Soc India Spec Publ 3(2014):63–84 Baruah A, Kumar Sarma A, Hinge G (2021) Hydrological-hydrodynamic nexus for evaluation of fish habitat suitability in the Bhogdoi River, India. J Hydrol Eng 26:4021032 Bentivoglio R, Isufi E, Jonkman SN, Taormina R (2021) Deep learning methods for flood mapping: a review of existing applications and future research directions. Hydrol Earth Syst Sci 26:4345–4378. https://doi.org/10.5194/hess-26-4345-2022 Bermúdez M, Cea L, Puertas J (2019) A rapid flood inundation model for hazard mapping based on least squares support vector machine regression. J Flood Risk Manag 12:e12522 Berrar D (2018) Bayes’ theorem and naive Bayes classifier. Encycl Bioinforma Comput Biol ABC Bioinforma 403:412 Bui DT, Hoang N-D, Martínez-Álvarez F et al (2020) A novel deep learning neural network approach for predicting flash flood susceptibility: a case study at a high frequency tropical storm area. Sci Total Environ 701:134413 Bui DT, Ngo P-TT, Pham TD et al (2019) A novel hybrid approach based on a swarm intelligence optimized extreme learning machine for flash flood susceptibility mapping. Catena 179:184–196 Cheng M-Y, Prayogo D (2014) Symbiotic organisms search: a new metaheuristic optimization algorithm. Comput Struct 139:98–112 Corne DW, Reynolds A, Bonabeau E (2012) Swarm Intelligence. In: Rozenberg G, Bäck T, Kok JN (eds) Handbook of Natural Computing. Springer, Berlin, Heidelberg, pp 1599–1622 Costache R (2019) Flash-flood potential index mapping using weights of evidence, decision trees models and their novel hybrid integration. Stoch Environ Res Risk Assess 33:1375–1402 Costache R, Ngo PTT, Bui DT (2020a) Novel ensembles of deep learning neural network and statistical learning for flash-flood susceptibility mapping. Water 12:1549 Costache R, Pham QB, Sharifi E et al (2019) Flash-flood susceptibility assessment using multi-criteria decision making and machine learning supported by remote sensing and GIS techniques. Remote Sens 12:106 Costache R, Țîncu R, Elkhrachy I et al (2020b) New neural fuzzy-based machine learning ensemble for enhancing the prediction accuracy of flood susceptibility mapping. Hydrol Sci J 65:2816–2837 Darabi H, Haghighi AT, Rahmati O et al (2021) A hybridized model based on neural network and swarm intelligence-grey wolf algorithm for spatial prediction of urban flood-inundation. J Hydrol 603:126854 Deroliya P, Ghosh M, Mohanty MP et al (2022) A novel flood risk mapping approach with machine learning considering geomorphic and socio-economic vulnerability dimensions. Sci Total Environ 851:158002 Gharehchopogh FS, Shayanfar H, Gholizadeh H (2020) A comprehensive survey on symbiotic organisms search algorithms. Artif Intell Rev 53:2265–2312 Ghosh A, Dey P, Ghosh T (2022a) Integration of RS-GIS with frequency ratio, fuzzy logic, logistic regression and decision tree models for flood susceptibility prediction in lower gangetic plain: a study on malda district of West Bengal, India. J Indian Soc Remote Sens 50:1725–1745 Ghosh S, Saha S, Bera B (2022b) Flood susceptibility zonation using advanced ensemble machine learning models within Himalayan foreland basin. Nat Hazards Res 2:363–374 Gunathilake MB, Amaratunga YV, Perera A et al (2020) Evaluation of future climate and potential impact on streamflow in the Upper Nan River Basin of Northern Thailand. Adv Meteorol 2020:1–15 Gunathilake MB, Senerath T, Rathnayake U (2021a) Artificial neural network based PERSIANN data sets in evaluation of hydrologic utility of precipitation estimations in a tropical watershed of Sri Lanka. AIMS Geosci 7:478–489 Gunathilake MB, Zamri MNM, Alagiyawanna TP et al (2021b) Hydrologic utility of satellite-based and gauge-based gridded precipitation products in the Huai Bang Sai Watershed of Northeastern Thailand. Hydrology 8:165 Ha H, Luu C, Bui QD et al (2021) Flash flood susceptibility prediction mapping for a road network using hybrid machine learning models. Nat Hazards 109:1247–1270 Hinge G, Hamouda MA, Mohamed MM (2024) Flash flood susceptibility modelling using soft computing-based approaches: from bibliometric to meta-data analysis and future research directions. Water 16:173 Ilia I, Tsangaratos P, Tzampoglou P et al (2022) Flash flood susceptibility mapping using stacking ensemble machine learning models. Geocarto Int 37:15010–15036. https://doi.org/10.1080/10106049.2022.2093990 Islam ARMT, Talukdar S, Mahato S et al (2021) Flood susceptibility modelling using advanced ensemble machine learning models. Geosci Front 12:101075 Jonkman SN (2005) Global perspectives on loss of human life caused by floods. Nat Hazards 34:151–175 Karunanayake C, Gunathilake MB, Rathnayake U (2020) Inflow forecast of iranamadu reservoir, Sri Lanka, under projected climate scenarios using artificial neural networks. Appl Comput Intell Soft Comput 2020:1–11 Khattak MS, Anwar F, Saeed TU et al (2016) Floodplain mapping using HEC-RAS and ArcGIS: a case study of Kabul River. Arab J Sci Eng 41:1375–1390 Kundapura S, Soman A, Kuruvilla E (2022) Machine learning ensemble model for flood susceptibility mapping. In: 2022 IEEE 7th International Conference on Recent Advances and Innovations in Engineering (ICRAIE). MANGALORE, India, pp 126–131. https://doi.org/10.1109/ICRAIE56454.2022.10054327 Liu J, Wang J, Xiong J et al (2021) Hybrid models incorporating bivariate statistics and machine learning methods for flash flood susceptibility assessment based on remote sensing datasets. Remote Sens 13:4945 Liu R, Chen Y, Wu J et al (2016) Assessing spatial likelihood of flooding hazard using naïve Bayes and GIS: a case study in Bowen Basin, Australia. Stoch Environ Res Risk Assess 30:1575–1590 Massam A, Smith H, Filipova V et al (2023) Flood risk in Kuala Lumpur, Malaysia: a consideration of flood defences in a broadscale hydraulic model. J Flood Risk Manag 16:e12907. https://doi.org/10.1111/jfr3.12907 Mitra R, Saha P, Das J (2022) Assessment of the performance of GIS-based Analytical Hierarchical Process (AHP) approach for flood modelling in Uttar Dinajpur district of West Bengal, India. Geomatics, Nat Hazards Risk 13:2183–2226 Ngo P-TT, Hoang N-D, Pradhan B et al (2018) A novel hybrid swarm optimized multilayer neural network for spatial prediction of flash floods in tropical areas using sentinel-1 SAR imagery and geospatial data. Sensors 18:3704 Ngo P-TT, Pham TD, Nhu V-H et al (2021) A novel hybrid quantum-PSO and credal decision tree ensemble for tropical cyclone induced flash flood susceptibility mapping with geospatial data. J Hydrol 596:125682 Nhu V-H, Thi Ngo P-T, Pham TD et al (2020) A new hybrid firefly–PSO optimized random subspace tree intelligence for torrential rainfall-induced flash flood susceptible mapping. Remote Sens 12:2688 Noble WS (2006) What is a support vector machine? Nat Biotechnol 24:1565–1567 Özdemir H, Baduna Koçyiğit M, Akay D (2023) Flood susceptibility mapping with ensemble machine learning: a case of Eastern Mediterranean basin, Türkiye. Stoch Environ Res Risk Assess 37:4273–4290. https://doi.org/10.1007/s00477-023-02507-z Pal S, Mazumdar D, Chakraborty PK (2015) District-wise trend analysis of rainfall pattern in last century (1901–2000) over Gangetic region in West Bengal, India. J Appl Nat Sci 7:750–757 Pal SC, Chowdhuri I, Das B et al (2022) Threats of climate change and land use patterns enhance the susceptibility of future floods in India. J Environ Manage 305:114317 Pham BT, Jaafari A, Van Phong T et al (2021) Improved flood susceptibility mapping using a best first decision tree integrated with ensemble learning techniques. Geosci Front 12:101105 Plataridis K, Mallios Z (2023) Flood susceptibility mapping using hybrid models optimized with Artificial Bee Colony. J Hydrol 624:129961 Popa MC, Peptenatu D, Drăghici CC, Diaconu DC (2019) Flood hazard mapping using the flood and flashflood potential index in the Buzău river catchment, Romania. Water 11:2116 Prasad P, Loveson VJ, Das B, Kotha M (2022) Novel ensemble machine learning models in flood susceptibility mapping. Geocarto Int 37:4571–4593. https://doi.org/10.1080/10106049.2021.1892209 Rahmati O, Darabi H, Panahi M et al (2020) Development of novel hybridized models for urban flood susceptibility mapping. Sci Rep 10:12937 Rudra K (2018) Rivers of the Ganga-Brahmaputra-Meghna Delta. Springer International Publishing, Cham. https://doi.org/10.1007/978-3-319-76544-0 Ruidas D, Chakrabortty R, Islam ARM et al (2022) A novel hybrid of meta-optimization approach for flash flood-susceptibility assessment in a monsoon-dominated watershed, Eastern India. Environ Earth Sci 81:1–22 Ruidas D, Saha A, Islam ARMT et al (2023) Development of geo-environmental factors controlled flash flood hazard map for emergency relief operation in complex hydro-geomorphic environment of tropical river, India. Environ Sci Pollut Res 30:106951–106966 Sadiq R, Akhtar Z, Imran M, Ofli F (2022) Integrating remote sensing and social sensing for flood mapping. Remote Sens Appl Soc Environ 25:100697 Saha A, Pal SC, Arabameri A et al (2021) Flood susceptibility assessment using novel ensemble of hyperpipes and support vector regression algorithms. Water 13:241 Satya Priya L, Young W, Hopson TM, Avasthi A (2017) Flood risk assessment and forecasting for the Ganges-Brahmaputra-Meghna River basins (English). Washington, D.C. : World Bank Group. https://documents.worldbank.org/curated/en/272611508255892547/Flood-risk-assessment-and-forecasting-for-the-Ganges-Brahmaputra-Meghna-River-basins Seydi ST, Kanani-Sadat Y, Hasanlou M et al (2022) Comparison of machine learning algorithms for flood susceptibility mapping. Remote Sens 15:192 Shahabi H, Shirzadi A, Ronoud S et al (2021) Flash flood susceptibility mapping using a novel deep learning model based on deep belief network, back propagation and genetic algorithm. Geosci Front 12:101100 Shirzadi A, Asadi S, Shahabi H et al (2020) A novel ensemble learning based on Bayesian Belief Network coupled with an extreme learning machine for flash flood susceptibility mapping. Eng Appl Artif Intell 96:103971 Singha C, Swain KC, Meliho M et al (2022) Spatial analysis of flood hazard zoning map using novel hybrid machine learning technique in Assam, India. Remote Sens 14:6229 Tang X, Li J, Liu M et al (2020) Flood susceptibility assessment based on a novel random Naïve Bayes method: A comparison between different factor discretization methods. Catena 190:104536 Tehrany MS, Pradhan B, Jebur MN (2014) Flood susceptibility mapping using a novel ensemble weights-of-evidence and support vector machine models in GIS. J Hydrol 512:332–343 Tehrany MS, Pradhan B, Jebur MN (2013) Spatial prediction of flood susceptible areas using rule based decision tree (DT) and a novel ensemble bivariate and multivariate statistical models in GIS. J Hydrol 504:69–79 Tino P, Benuskova L, Sperduti A (2015) Artificial neural network models. In: Kacprzyk J, Pedrycz W (eds) Springer Handbook of Computational Intelligence. Springer, Berlin, Heidelberg, pp 455–471 Vongkusolkit J, Peng B, Wu M et al (2023) Near real-time flood mapping with weakly supervised machine learning. Remote Sens 15:3263 Yaseen A, Lu J, Chen X (2022) Flood susceptibility mapping in an arid region of Pakistan through ensemble machine learning model. Stoch Environ Res Risk Assess 36:3041–3061. https://doi.org/10.1007/s00477-022-02179-1 Yilmaz CS, Yilmaz V, Gungor O, Shan J (2019) Metaheuristic pansharpening based on symbiotic organisms search optimization. ISPRS J Photogramm Remote Sens 158:167–187 Zhang K, Shalehy MH, Ezaz GT et al (2022) An integrated flood risk assessment approach based on coupled hydrological-hydraulic modeling and bottom-up hazard vulnerability analysis. Environ Model Softw 148:105279