Geospatial modeling using hybrid machine learning approach for flood susceptibility

Springer Science and Business Media LLC - Tập 15 - Trang 2619-2636 - 2022
Bibhu Prasad Mishra1, Dillip Kumar Ghose2, Deba Prakash Satapathy1
1Department of Civil Engineering, Odisha University of Technology and Research, Bhubaneswar, India
2Department of Civil Engineering, National Institute of Technology, Silchar, India

Tóm tắt

Advanced methods for flood susceptibility mapping are required to minimize hazards in the watershed. Here, Partial Least Square-Structural Equation Model (PLS-SEM) was introduced to analyze the impact of flood influencing factors. PLS-SEM integrated with four Machine Learning (ML) methods as Multi-Layer Perceptron Neural Network (MLPNN), K Nearest Neighbor (KNN), Support Vector Machine (SVM) and Radial Basis Function Neural network (RBFN). In addition, significant flood influencing factors from PLS-SEM analysis was taken as the input of ML models. Then SVM, MLPNN, KNN, and RBFN integrated with the PLS-SEM classifier to develop hybrid models for constructing FSM. The performance of models is assessed in terms of standard statistical methods. The performance of the achieved model is good having AUROC > 0.8 and PLS-SEM-SVM (AUROC = 0.978) perform superior than others. Thus, hybrid SVM model can be best utilized for flood susceptibility. This study provides the importance of mechanism for flood influencing factors and extends the application of proposed hybrid ML models to minimize flood risk.

Tài liệu tham khảo

Alexander M, Viavattene C, Faulkner H, Priest S (2011) A GIS-based flood risk assessment tool: supporting flood incident management at the local scale. Flood Hazard Research Centre, Middlesex University, London Apel H, Aronica GT, Kreibich H, Thieken AH (2009) Flood risk analyses—how detailed do we need to be? Nat Hazards 49(1):79–98 Bates PD (2004) Remote sensing and flood inundation modelling. Hydrol Process 18(13):2593–2597 Bentler PM, Huang W (2014) On components, latent variables, PLS and simple methods: reactions to Rigdon’s rethinking of PLS. Long Range Plan 47(3):138–145 Biswajeet P, Mardiana S (2009) Flood hazrad assessment for cloud prone rainy areas in a typical tropical environment. Disaster Advances 2(2):7–15 Boser BE, Guyon IM, Vapnik VN (1992) A training algorithm for optimal margin classifiers. Proceedings of the Fifth Annual Workshop on Computational Learning Theory, pp 144–152 Bradley AP (1997) The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recogn 30(7):1145–1159 Chau KW, Wu CL, Li YS (2005) Comparison of several flood forecasting models in Yangtze River. J Hydrol Eng 10(6):485–491 Chen Y-R, Yeh C-H, Yu B (2011) Integrated application of the analytic hierarchy process and the geographic information system for flood risk assessment and flood plain management in Taiwan. Nat Hazards 59(3):1261–1276 Choubin B, Moradi E, Golshan M, Adamowski J, Sajedi-Hosseini F, Mosavi A (2019) An ensemble prediction of flood susceptibility using multivariate discriminant analysis, classification and regression trees, and support vector machines. Sci Total Environ 651:2087–2096 Csáfordi P, Pődör A, Bug J, Gribovszki Z (2012) Soil erosion analysis in a small forested catchment supported by ArcGIS Model Builder. Acta Silv Lignaria Hung 8:39–55 Dankers R, Arnell NW, Clark DB, Falloon PD, Fekete BM, Gosling SN, Heinke J, Kim H, Masaki Y, Satoh Y (2014) First look at changes in flood hazard in the inter-sectoral impact model intercomparison project ensemble. Proc Natl Acad Sci 111(9):3257–3261 Dano UL, Balogun A-L, Matori A-N, Wan Yusouf K, Abubakar IR, Said Mohamed MA, Aina YA, Pradhan B (2019) Flood susceptibility mapping using GIS-based analytic network process: a case study of Perlis, Malaysia. Water 11(3):615 Dijkstra TK, Henseler J (2015) Consistent partial least squares path modeling. MIS Q 39(2):297–316 Elshorbagy A, Corzo G, Srinivasulu S, Solomatine DP (2010a) Experimental investigation of the predictive capabilities of data driven modeling techniques in hydrology-part 1: concepts and methodology. Hydrol Earth Syst Sci 14(10):1931–1941 Elshorbagy A, Corzo G, Srinivasulu S, Solomatine DP (2010b) Experimental investigation of the predictive capabilities of data driven modeling techniques in hydrology-part 2: application. Hydrol Earth Syst Sci 14(10):1943–1961 Fawcett T (2006) Introduction to receiver operator curves. Pattern Recognit Lett 27:861–874 Fenicia F, Kavetski D, Savenije HHG, Clark MP, Schoups G, Pfister L, Freer J (2014) Catchment properties, function, and conceptual model representation: is there a correspondence? Hydrol Process 28(4):2451–2467 Furl C, Ghebreyesus D, Sharif HO (2018) Assessment of the performance of satellite-based precipitation products for flood events across diverse spatial scales using GSSHA modeling system. Geosciences 8(6):191 Gerlach RW, Kowalski BR, Wold HOA (1979) Partial least-squares path modelling with latent variables. Anal Chim Acta 112(4):417–421 Ghose DK, Panda SS, Swain PC (2010) Prediction of water table depth in western region, Orissa using BPNN and RBFN neural networks. J Hydrol 394(3):296–304. https://doi.org/10.1016/j.jhydrol.2010.09.003 Gokceoglu C, Sonmez H, Nefeslioglu HA, Duman TY, Can T (2005) The 17 March 2005 Kuzulu landslide (Sivas, Turkey) and landslide-susceptibility map of its near vicinity. Eng Geol 81(1):65–83 Hair JF, Hult GTM, Ringle CM, Sarstedt M, Thiele KO (2017) Mirror, mirror on the wall: a comparative evaluation of composite-based structural equation modeling methods. J Acad Mark Sci 45(5):616–632 Haq M, Akhtar M, Muhammad S, Paras S, Rahmatullah J (2012) Techniques of remote sensing and GIS for flood monitoring and damage assessment: a case study of Sindh province, Pakistan. Egypt J Remote Sens Space Sci 15(2):135–141 Heba FE, Darwish A, Hassanien AE, Abraham A (2010) Principle components analysis and support vector machine-based intrusion detection system. 2010 10th international conference on intelligent systems design and applications, pp 363–367 Henseler J, Dijkstra TK, Sarstedt M, Ringle CM, Diamantopoulos A, Straub DW, Ketchen DJ Jr, Hair JF, Hult GTM, Calantone RJ (2014) Common beliefs and reality about PLS: comments on Rönkkö and Evermann (2013). Organ Res Methods 17(2):182–209 Hong H, Panahi M, Shirzadi A, Ma T, Liu J, Zhu A-X, Chen W, Kougias I, Kazakis N (2018) Flood susceptibility assessment in Hengfeng area coupling adaptive neuro-fuzzy inference system with genetic algorithm and differential evolution. Sci Total Environ 621:1124–1141 Horng S-J, Su M-Y, Chen Y-H, Kao T-W, Chen R-J, Lai J-L, Perkasa CD (2011) A novel intrusion detection system based on hierarchical clustering and support vector machines. Expert Syst Appl 38(1):306–313 Horritt MS (2006) A methodology for the validation of uncertain flood inundation models. J Hydrol 326(1–4):153–165 Hosseini FS, Choubin B, Mosavi A, Nabipour N, Shamshirband S, Darabi H, Haghighi AT (2020) Flash-flood hazard assessment using ensembles and Bayesian-based machine learning models: Application of the simulated annealing feature selection method. Sci Total Environ 711:135161 Huong HTL, Pathirana A (2013) Urbanization and climate change impacts on future urban flooding in Can Tho city, Vietnam. Hydrol Earth Syst Sci 17(1):379–394 Hussain M, Wajid SK, Elzaart A, Berbar M (2011) A comparison of SVM kernel functions for breast cancer detection. 2011 eighth international conference computer graphics, imaging and visualization, pp 145–150 Idrissi TEL, Idri A, Bakkoury Z (2019) Systematic map and review of predictive techniques in diabetes self-management. Int J Inf Manage 46:263–277 Jain AK, Mao J, Mohiuddin KM (1996) Artificial neural networks: a tutorial. Computer 29(3):31–44 Kazakis N, Kougias I, Patsialis T (2015) Assessment of flood hazard areas at a regional scale using an index-based approach and analytical hierarchy process: application in Rhodope-Evros region, Greece. Sci Total Environ 538:555–563 Khosravi K, Pham BT, Chapi K, Shirzadi A, Shahabi H, Revhaug I, Prakash I, Bui DT (2018) A comparative assessment of decision trees algorithms for flash flood susceptibility modeling at Haraz watershed, northern Iran. Sci Total Environ 627:744–755 Khu ST, Liong S, Babovic V, Madsen H, Muttil N (2001) Genetic programming and its application in real-time runoff forecasting 1. J Am Water Resour Assoc 37(2):439–451 Kia MB, Pirasteh S, Pradhan B, Mahmud AR, Sulaiman WNA, Moradi A (2012) An artificial neural network model for flood simulation using GIS: Johor River Basin, Malaysia. Environ Earth Sci 67(1):251–264 Lawal DU, Matori AN, Hashim AM, Wan Yusof K, Chandio IA (2012) Detecting flood susceptible areas using GIS-based analytic hierarchy process Lawal DU, Matori A-N, Yusuf KW, Hashim AM, Balogun A-L (2014) Analysis of the flood extent extraction model and the natural flood influencing factors: a GIS-based and remote sensing analysis. IOP Conf Ser: Earth Environ Sci 18(1):012059 Lecca G, Petitdidier M, Hluchy L, Ivanovic M, Kussul N, Ray N, Thieron V (2011) Grid computing technology for hydrological applications. J Hydrol 403(1–2):186–199 Lee S, Kim J-C, Jung H-S, Lee MJ, Lee S (2017) Spatial prediction of flood susceptibility using random-forest and boosted-tree models in Seoul metropolitan city, Korea. Geomat Nat Haz Risk 8(2):1185–1203 Leong L-Y, Hew T-S, Ooi K-B, Wei J (2020) Predicting mobile wallet resistance: a two-staged structural equation modeling-artificial neural network approach. Int J Inf Manage 51:102047 Liang D, Xu Z, Liu D (2017) Three-way decisions with intuitionistic fuzzy decision-theoretic rough sets based on point operators. Inf Sci 375:183–201 Lohmöller J-B (1989) Predictive vs. structural modeling: Pls vs. ml. In: Latent variable path modeling with partial least squares. Springer, pp 199–226 Luu C, von Meding J, Kanjanabootra S (2018) Assessing flood hazard using flood marks and analytic hierarchy process approach: a case study for the 2013 flood event in Quang Nam, Vietnam. Nat Hazards 90(3):1031–1050 Luu C, Pham BT, van Phong T, Costache R, Nguyen HD, Amiri M, Bui QD, Nguyen LT, van Le H, Prakash I (2021) GIS-based ensemble computational models for flood susceptibility prediction in the Quang Binh Province, Vietnam. J Hydrol 599:126500 Mahmoud SH, Gan TY (2018) Urbanization and climate change implications in flood risk management: developing an efficient decision support system for flood susceptibility mapping. Sci Total Environ 636:152–167 Manandhar B (2010) Flood plain analysis and risk assessment of Lothar Khola. Master of Science Thesis in Watershed Management. Tribhuvan University Institute of Forestry Pokhara, Nepal Meyer V, Scheuer S, Haase D (2009) A multicriteria approach for flood risk mapping exemplified at the Mulde river, Germany. Nat Hazards 48(1):17–39 Miller JD, Hutchins M (2017) The impacts of urbanisation and climate change on urban flooding and urban water quality: A review of the evidence concerning the United Kingdom. J Hydrol: Reg Stud 12:345–362 Mojaddadi H, Pradhan B, Nampak H, Ahmad N, bin Ghazali AH (2017) Ensemble machine-learning-based geospatial approach for flood risk assessment using multi-sensor remote-sensing data and GIS. Geomat Nat Haz Risk 8(2):1080–1102 Moore ID, Wilson JP (1992) Length-slope factors for the revised universal soil loss equation: simplified method of estimation. J Soil Water Conserv 47(5):423–428 Oh H-J, Pradhan B (2011) Application of a neuro-fuzzy model to landslide-susceptibility mapping for shallow landslides in a tropical hilly area. Comput Geosci 37(9):1264–1276 Pamučar D, Stević Ž, Sremac S (2018) A new model for determining weight coefficients of criteria in mcdm models: full consistency method (fucom). Symmetry 10(9):393 Popovic M, Kuzmanović M, Savić G (2018) A comparative empirical study of analytic hierarchy process and conjoint analysis: literature review. Decis Mak Appl Manag Eng 1(2):153–163 Pradhan B, Hagemann U, Tehrany MS, Prechtel N (2014) An easy to use ArcMap based texture analysis program for extraction of flooded areas from TerraSAR-X satellite image. Comput Geosci 63:34–43 Radmehr A, Araghinejad S (2014) Developing strategies for urban flood management of Tehran city using SMCDM and ANN. J Comput Civ Eng 28(6):05014006 Rahmati O, Zeinivand H, Besharat M (2016) Flood hazard zoning in Yasooj region, Iran, using GIS and multi-criteria decision analysis. Geomat Nat Haz Risk 7(3):1000–1017 Rizeei HM, Pradhan B, Saharkhiz MA, Lee S (2019) Groundwater aquifer potential modeling using an ensemble multi-adoptive boosting logistic regression technique. J Hydrol 579:124172 Rosenblatt F (1961) Principles of neurodynamics. perceptrons and the theory of brain mechanisms. Cornell Aeronautical Lab Inc Buffalo NY Saini SS, Kaushik SP (2012) Risk and vulnerability assessment of flood hazard in part of Ghaggar Basin: a case study of Guhla block, Kaithal, Haryana, India. Int J Geomat Geosci 3(1):42–54 Samanta RK, Bhunia GS, Shit PK, Pourghasemi HR (2018) Flood susceptibility mapping using geospatial frequency ratio technique: a case study of Subarnarekha River Basin, India. Model Earth Syst Environ 4(1):395–408 Santos PP, Reis E, Pereira S, Santos M (2019) A flood susceptibility model at the national scale based on multicriteria analysis. Sci Total Environ 667:325–337 Siahkamari S, Haghizadeh A, Zeinivand H, Tahmasebipour N, Rahmati O (2018) Spatial prediction of flood-susceptible areas using frequency ratio and maximum entropy models. Geocarto Int 33(9):927–941 Siddayao GP, Valdez SE, Fernandez PL (2014) Analytic hierarchy process (AHP) in spatial modeling for floodplain risk assessment. Int J Mach Learn Comput 4(5):450 Sinha R, Bapalu GV, Singh LK, Rath B (2008) Flood risk analysis in the Kosi river basin, north Bihar using multi-parametric approach of analytical hierarchy process (AHP). J Indian Soc Remote Sens 36(4):335–349 Svozil D, Kvasnicka V, Pospichal J (1997) Introduction to multi-layer feed-forward neural networks. Chemom Intell Lab Syst 39(1):43–62 Taneja A, Arora A (2019) Modeling user preferences using neural networks and tensor factorization model. Int J Inf Manage 45:132–148 Tang Z, Zhang H, Yi S, Xiao Y (2018) Assessment of flood susceptible areas using spatially explicit, probabilistic multi-criteria decision analysis. J Hydrol 558:144–158 Tehrany MS, Pradhan B, Jebur MN (2013) Spatial prediction of flood susceptible areas using rule based decision tree (DT) and a novel ensemble bivariate and multivariate statistical models in GIS. J Hydrol 504:69–79 Tehrany MS, Pradhan B, Jebur MN (2014) Flood susceptibility mapping using a novel ensemble weights-of-evidence and support vector machine models in GIS. J Hydrol 512:332–343 Teo A-C, Tan GW-H, Ooi K-B, Hew T-S, Yew K-T (2015) The effects of convenience and speed in m-payment. Ind Manag Data Syst 115:311–331 Termeh SVR, Kornejady A, Pourghasemi HR, Keesstra S (2018) Flood susceptibility mapping using novel ensembles of adaptive neuro fuzzy inference system and metaheuristic algorithms. Sci Total Environ 615:438–451 Toth E, Brath A, Montanari A (2000) Comparison of short-term rainfall prediction models for real-time flood forecasting. J Hydrol 239(1–4):132–147 Truong XL, Mitamura M, Kono Y, Raghavan V, Yonezawa G, Truong XQ, Do TH, Tien Bui D, Lee S (2018) Enhancing prediction performance of landslide susceptibility model using hybrid machine learning approach of bagging ensemble and logistic model tree. Appl Sci 8(7):1046 Vapnik VN (1995) The nature of statistical learning theory. Springer, Berlin Varoonchotikul P (2003) Flood forecasting using artificial neural networks. CRC Press, Boca Raton Vojinovic Z, Abbott MB (2012) Flood risk and social justice. IWA Publishing Wanders N, Bierkens MFP, de Jong SM, de Roo A, Karssenberg D (2014) The benefits of using remotely sensed soil moisture in parameter identification of large-scale hydrological models. Water Resour Res 50(8):6874–6891 Wang Y, Fang Z, Hong H, Peng L (2020) Flood susceptibility mapping using convolutional neural network frameworks. J Hydrol 582:124482 Xia X, Liang Q, Ming X, Hou J (2017) An efficient and stable hydrodynamic model with novel source term discretization schemes for overland flow and flood simulations. Water Resour Res 53(5):3730–3759 Yahaya S, Ahmad N, Abdalla RF (2010) Multicriteria analysis for flood vulnerable areas in Hadejia-Jama’are River basin, Nigeria. Eur J Sci Res 42(1):71–83 Yousefi S, Mirzaee S, Keesstra S, Surian N, Pourghasemi HR, Zakizadeh HR, Tabibian S (2018) Effects of an extreme flood on river morphology (case study: Karoon River, Iran). Geomorphology 304:30–39 Zazo S, Rodríguez-Gonzálvez P, Molina J-L, González-Aguilera D, Agudelo-Ruiz CA, Hernández-López D (2018) Flood hazard assessment supported by reduced cost aerial precision photogrammetry. Remote Sens 10(10):1566 Zenggang X, Zhiwen T, Xiaowen C, Xue-min Z, Kaibin Z, Conghuan Y (2021) Research on image retrieval algorithm based on combination of color and shape features. J Signal Process Syst 93(2):139–146 Zhang Q, Xu C-Y, Zhang Z, Chen YD, Liu C, Lin H (2008) Spatial and temporal variability of precipitation maxima during 1960–2005 in the Yangtze River basin and possible association with large-scale circulation. J Hydrol 353(3–4):215–227 Zhao G, Pang B, Xu Z, Yue J, Tu T (2018) Mapping flood susceptibility in mountainous areas on a national scale in China. Sci Total Environ 615:1133–1142 Zhu K (2007) Operator theory in function spaces (Issue 138). American Mathematical Soc