Comparison of a logistic regression and Naïve Bayes classifier in landslide susceptibility assessments: The influence of models complexity and training dataset size

CATENA - Tập 145 - Trang 164-179 - 2016
Paraskevas Tsangaratos1, Ioanna Ilia2
1Mining and Metallurgical Engineering, National Technical University of Athens, School of Mining and Metallurgical Engineering, Department of Geological Studies, Zografou Campus: Heroon Polytechniou 9, 15780 Zografou, Greece
2National Technical University of Athens, School of Mining and Metallurgical Engineering, Department of Geological Studies, Zografou Campus: Heroon Polytechniou 9, 15780 Zografou, Greece

Tóm tắt

Từ khóa


Tài liệu tham khảo

Aguado, 2012, 576

Akgun, 2012, A comparison of landslide susceptibility maps produced by logistic regression, multi-criteria decision, and likelihood ratio methods: a case study at İzmir, Turkey, Landslides, 9, 93, 10.1007/s10346-011-0283-7

Akgun, 2012, Application of remote sensing data and GIS for landslide risk assessment as an environmental threat to Izmir city (west Turkey), Environ. Monit. Assess., 184, 5453, 10.1007/s10661-011-2352-8

Aleotti, 1999, Landslide hazard assessment: summary review and new perspectives, Bull. Eng. Geol. Environ., 58, 21, 10.1007/s100640050066

Alexander, 1992, On the causes of landslides: human activities, perception, and natural processes, Environ. Geol. Water Sci., 20, 165, 10.1007/BF01706160

Alimohammadlou, 2013, Landslide process and impacts: a proposed classification method, Catena, 104, 219, 10.1016/j.catena.2012.11.013

Alimohammadlou, 2014, Estimation of rainfall-induced landslides using ANN and fuzzy clustering methods: a case study in Saeen Slope, Azerbaijan province, Iran, Catena, 120, 149, 10.1016/j.catena.2014.04.009

Aubouin, 1959, Contribution a l' etude geologique de la Grece septentrionale: les confins de l' Epire et de la Thessalie, Ann. Geol. Pays Hellen., 10, 1

Ayalew, 2005, The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko Mountains, Central Japan, Geomorphology, 65, 15, 10.1016/j.geomorph.2004.06.010

Baioni, 2011, Human activity and damaging landslides and floods on Madeira Island, Nat. Hazards Earth Syst. Sci., 11, 3035, 10.5194/nhess-11-3035-2011

Ballabio, 2012, Support vector machines for landslide susceptibility mapping: the Staffora River Basin case study, Italy, Math. Geosci., 44, 47, 10.1007/s11004-011-9379-9

Bijukchhen, 2013, A comparative evaluation of heuristic and bivariate statistical modeling for landslide susceptibility mappings in Ghurmi-Dhad Khola, east Nepal, Arab. J. Geosci., 6, 2727, 10.1007/s12517-012-0569-7

Brenning, 2005, Spatial prediction models for landslide hazards: review, comparison and evaluation, Nat. Hazards Earth Syst. Sci., 5, 853, 10.5194/nhess-5-853-2005

Brunn, 1956, Contribution à l'étude géologique du Pinde septentrional et d'une partie de la Macédoine occidentale

Can, 2005, Susceptibility assessments of shallow earthlows triggered by heavy rainfall at three catchments by logistic regression analysis, Geomorphology, 72, 250, 10.1016/j.geomorph.2005.05.011

Caniani, 2008, Neural networks and landslide susceptibility: a case study of the urban area of Potenza, Nat. Hazards, 45, 55, 10.1007/s11069-007-9169-3

Castellanos Abella, 2007, Generation of a landslide risk index map for Cuba using spatial multi-criteria evaluation, Landslides, 4, 311, 10.1007/s10346-007-0087-y

Cestnik, 1987, Assistant 86: a knowledge-elicitation tool for sophisticated users, 31

Chacon, 2006, Engineering geology maps: landslides and geographical information systems, Bull. Eng. Geol. Environ., 65, 341, 10.1007/s10064-006-0064-z

Champatiray, 2007, Fuzzy-based method for landslide hazard assessment in active seismic zone of Himalaya, Landslides, 4, 101, 10.1007/s10346-006-0068-6

Cheeseman, 1996, Bayesian classification (autoclass): theory and results, 153

Choi, 2010, Validation of an artificial neural network model for landslide susceptibility mapping, Environ. Earth Sci., 60, 473, 10.1007/s12665-009-0188-0

Chung, 2003, Validation of spatial prediction models for landslide hazard mapping, Nat. Hazards, 30, 451, 10.1023/B:NHAZ.0000007172.62651.2b

Conforti, 2014, Evaluation of prediction capability of the artificial neural networks for mapping landslide susceptibility in the Turbolo River catchment (northern Calabria, Italy), Catena, 113, 236, 10.1016/j.catena.2013.08.006

Cooper, 1992, Bayesian method for the induction of probabilistic networks from data, Mach. Learn., 9, 309, 10.1007/BF00994110

Costanzo, 2012, Factors selection in landslide susceptibility modelling on large scale following the GIS matrix method: application to the River Beiro Basin (Spain), Nat. Hazards Earth Syst. Sci., 12, 327, 10.5194/nhess-12-327-2012

Dai, 2002, Landslide risk assessment and management: an overview, Eng. Geol., 64, 65, 10.1016/S0013-7952(01)00093-X

Das, 2010, Landslide susceptibility assessment using logistic regression and its comparison with a rock mass classification system, along a road section in the northern Himalayas (India), Geomorphology, 114, 627, 10.1016/j.geomorph.2009.09.023

Dash, 1997, Feature selection for classification, Intell. Data Anal., 1, 679, 10.1016/S1088-467X(97)00008-5

Domingos, 1997, Beyond independence: conditions for the optimality of the simple Bayesian classifier, Mach. Learn., 29, 103, 10.1023/A:1007413511361

Dormann, 2013, Collinearity: a review of methods to deal with it and a simulation study evaluating their performance, Ecography, 36, 27, 10.1111/j.1600-0587.2012.07348.x

Ercanoglu, 2002, Assessment of landslide susceptibility for a landslide prone area (north of Yenice, NW Turkey) by fuzzy approach, Environ. Geol., 41, 720, 10.1007/s00254-001-0454-2

Ercanoglu, 2004, Use of fuzzy relations to produce landslide susceptibility map of a landslide prone area West Black Sea region, Turkey, Eng. Geol., 75, 229, 10.1016/j.enggeo.2004.06.001

Ermini, 2005, Artificial neural networks applied to landslide susceptibility assessment, Geomorphology, 66, 327, 10.1016/j.geomorph.2004.09.025

ESRI, 2013

Fawcett, 2006, An introduction to ROC analysis, Pattern Recogn. Lett., 27, 861, 10.1016/j.patrec.2005.10.010

Feizizadeh, 2013, GIS-multicriteria decision analysis for landslide susceptibility mapping: comparing three methods for the Urmia lake basin, Iran, Nat. Hazards, 65, 2105, 10.1007/s11069-012-0463-3

Feizizadeh, 2013, Integrating GIS based fuzzy set theory in multicriteria evaluation methods for landslide susceptibility mapping, Int. J. Geoinf., 9, 49

Feizizadeh, 2014, A GIS-based extended fuzzy multi-criteria evaluation for landslide susceptibility mapping, Comput. Geosci., 73, 208, 10.1016/j.cageo.2014.08.001

Felicisimo, 2013, Mapping landslide susceptibility with logistic regression, multiple adaptive regression splines, classification and regression trees, and maximum entropy methods: a comparative study, Landslides, 10, 175, 10.1007/s10346-012-0320-1

Fell, 2008, Guidelines for landslide susceptibility, hazard and risk zoning for land-use planning, Eng. Geol., 102, 99, 10.1016/j.enggeo.2008.03.014

Ferentinou, 2007, Computational intelligence tools for the prediction of slope performance, Comput. Geotech., 34, 362, 10.1016/j.compgeo.2007.06.004

Flentje, 2007, Landslide susceptibility and hazard derived from a landslide inventory using data mining — an Australian case study, 1

Goetz, 2015, Evaluating machine learning and statistical prediction techniques for landslide susceptibility modeling, Comput. Geosci., 81, 1, 10.1016/j.cageo.2015.04.007

Gorsevski, 2006, Spatial prediction of landslide hazard using logistic regression and ROC analysis, Trans. GIS, 10, 395, 10.1111/j.1467-9671.2006.01004.x

Guns, 2012, Logistic regression applied to natural hazards: rare event logistic regression with replications, Nat. Hazards Earth Syst. Sci., 12, 1937, 10.5194/nhess-12-1937-2012

Guyon, 2003, An introduction to variable and feature selection, J. Mach. Learn. Res., 3, 1157

Guzzetti, 1999, Landslide hazard evaluation: a review of current techniques and their application in a multi-scale study, Central Italy, Geomorphology, 31, 181, 10.1016/S0169-555X(99)00078-1

Guzzetti, 2005, Probabilistic landslide hazard assessment at the basin scale, Geomorphology, 72, 272, 10.1016/j.geomorph.2005.06.002

Hall, 2009

Halvorson, 2010, 608

Hanley, 1982, The meaning and use of the area under a receiver operating characteristic (ROC) curve, Radiology, 143, 29, 10.1148/radiology.143.1.7063747

Heckmann, 2014, Sample size matters: investigating the effect of sample size on a logistic regression susceptibility model for debris flows, Nat. Hazards Earth Syst. Sci., 14, 259, 10.5194/nhess-14-259-2014

Hong, 2015, Spatial prediction of landslide hazard at the Yihuang area (China) using two-class kernel logistic regression, alternating decision tree and support vector machines, Catena, 133, 266, 10.1016/j.catena.2015.05.019

Hutchinson, 1995, Keynote paper: landslide hazard assessment, 1805

Ilia, 2015, Applying weight of evidence method and sensitivity analysis to produce a landslide susceptibility map, Landslides

Irigaray, 2007, Evaluation and validation of landslide-susceptibility maps obtained by a GIS matrix method: examples from the Betic Cordillera (southern Spain), Nat. Hazards, 41, 61, 10.1007/s11069-006-9027-8

Jain, 2000, Statistical pattern recognition: a review, IEEE Trans. Pattern Anal. Mach. Intell., 22, 4, 10.1109/34.824819

John, 1995, Estimating continuous distributions in Bayesian classifiers, 338

Jones, 2008

Kayastha, 2012, Landslide susceptibility mapping using the weight of evidence method in the Tinau watershed, Nepal, Nat. Hazards, 63, 479, 10.1007/s11069-012-0163-z

Korup, 2014, Landslide prediction from machine learning, Geol. Today, 30, 26, 10.1111/gto.12034

Koukis, 2005, Landslides Hazard Zonation in Greece, 291

Kouli, 2014, Landslide susceptibility mapping by comparing the WLC and WofE multi-criteria methods in the West Crete Island, Greece, Environ. Earth Sci., 72, 5197, 10.1007/s12665-014-3389-0

Lee, 2004, Application of likelihood ratio and logistic regression models to landslide susceptibility mapping using GIS, Environ. Manag., 34, 223, 10.1007/s00267-003-0077-3

Lee, 2007, Landslide hazard mapping at Selangor, Malaysia using frequency ratio and logistic regression models, Landslides, 4, 33, 10.1007/s10346-006-0047-y

Lee, 2006, Landslide susceptibility mapping in the Damrei Romel area, Cambodia using frequency ratio and logistic regression models, Environ. Geol., 50, 847, 10.1007/s00254-006-0256-7

Lee, 2003, Landslide susceptibility analysis using GIS and artificial neural network, Earth Surf. Process. Landf., 28, 1361, 10.1002/esp.593

Lee, 2004, Determination and application of the weights for landslide susceptibility mapping using an artificial neural network, Eng. Geol., 71, 289, 10.1016/S0013-7952(03)00142-X

Mancini, 2010, GIS and statistical analysis for landslide susceptibility mapping in the Daunia area, Italy, Nat. Hazards Earth Syst. Sci., 10, 1851, 10.5194/nhess-10-1851-2010

Marjanovic, 2011, Landslide susceptibility assessment using SVM machine learning algorithm, Eng. Geol., 123, 225, 10.1016/j.enggeo.2011.09.006

Marquardt, 1970, Generalized inverses, ridge regression, biased linear estimation, and non – linear estimation, Technometrics, 12, 605, 10.2307/1267205

Melchiorre, 2008, Artificial neural networks and cluster analysis in landslide susceptibility zonation, Geomorphology, 94, 379, 10.1016/j.geomorph.2006.10.035

Miner, 2010, A comparative study of various data mining techniques as applied to the modeling of landslide susceptibility on the Bellarine Peninsula, Victoria, Australia, 352

Montgomery, 2001

Murakami, 2010, Applying the Naïve Bayes classifier with kernel density estimation to the prediction of protein-protein interaction sites, Bioinformatics, 26, 1841, 10.1093/bioinformatics/btq302

Muthu, 2008, Landslide possibility mapping using fuzzy approaches, IEEE Trans. Geosci. Remote Sens., 46, 1253, 10.1109/TGRS.2007.912441

Neaupane, 2004, Use of back propagation neural network for landslide monitoring: a case study in the higher Himalaya, Eng. Geol., 74, 213, 10.1016/j.enggeo.2004.03.010

Nefeslioglu, 2008, An assessment on the use of logistic regression and artificial neural networks with different sampling strategies for the preparation of landslide susceptibility maps, Eng. Geol., 97, 171, 10.1016/j.enggeo.2008.01.004

Nefeslioglu, 2010, Assessment of landslide susceptibility by decision trees in the metropolitan area of Istanbul, Turkey, Math. Probl. Eng., 10.1155/2010/901095

Negnevitsky, 2002, 394

Ng, 2001, On discriminative vs. generative classifiers: a comparison of logistic regression and naive Bayes, 841

O'brien, 2007, A caution regarding rules of thumb for variance inflation factors, Qual. Quant., 41, 673, 10.1007/s11135-006-9018-6

Oh, 2010, Cross-validation of logistic regression model for landslide susceptibility mapping at Ganeoung areas, Korea, Disaster Adv., 3, 44

Oh, 2011, Landslide susceptibility mapping on Panaon Island, Philippines using a geographic information system, Environ. Earth Sci., 62, 935, 10.1007/s12665-010-0579-2

Oh, 2011, Application of a neuro-fuzzy model to landslide susceptibility mapping in a tropical hilly area, Comput. Geosci., 37, 1264, 10.1016/j.cageo.2010.10.012

Poudyal, 2010, Landslide susceptibility maps comparing frequency ratio and artificial neural networks: a case study from the Nepal Himalaya, Environ. Earth Sci., 61, 1049, 10.1007/s12665-009-0426-5

Pourghasemi, 2012, Application of fuzzy logic and analytical hierarchy process (AHP) to landslide susceptibility mapping at Haraz watershed, Iran, Nat. Hazards, 63, 965, 10.1007/s11069-012-0217-2

Pourghasemi, 2012, Landslide susceptibility mapping using index of entropy and conditional probability models in GIS: Safarood Basin, Iran, Catena, 97, 71, 10.1016/j.catena.2012.05.005

Pourghasemi, 2013, Landslide susceptibility mapping by binary logistic regression, analytical hierarchy process, and statistical index models and assessment of their performances, Nat. Hazards, 69, 749, 10.1007/s11069-013-0728-5

Pourghasemi, 2013, Landslide susceptibility mapping using support vector machine and GIS at the Golestan Province, Iran, J. Earth Syst. Sci., 122, 349, 10.1007/s12040-013-0282-2

Pradhan, 2010, Application of an advanced fuzzy logic model for landslide susceptibility analysis, Int. J. Comput. Intell. Syst., 3, 370, 10.1080/18756891.2010.9727707

Pradhan, 2011, Use of GIS-based fuzzy logic relations and its cross application to produce landslide susceptibility maps in three test areas in Malaysia, Environ. Earth Sci., 63, 329, 10.1007/s12665-010-0705-1

Pradhan, 2011, Manifestation of an advanced fuzzy logic model coupled with geoinformation techniques for landslide susceptibility analysis, Environ. Ecol. Stat., 18, 471, 10.1007/s10651-010-0147-7

Pradhan, 2013, A comparative study on the predictive ability of the decision tree, support vector machine and neuro-fuzzy models in landslide susceptibility mapping using GIS, Comput. Geosci., 51, 350, 10.1016/j.cageo.2012.08.023

Pradhan, 2010, Delineation of landslide hazard areas on Penang Island, Malaysia, by using frequency ratio, logistic regression, and artificial neural network models, Environ. Earth Sci., 60, 1037, 10.1007/s12665-009-0245-8

Pradhan, 2010, Landslide susceptibility assessment and factor effect analysis: back-propagation artificial neural networks and their comparison with frequency ratio and bivariate logistic regression modelling, Environ. Model. Softw., 25, 747, 10.1016/j.envsoft.2009.10.016

Pradhan, 2010, Regional landslide susceptibility analysis using back – propagation neural network model at Cameron highland, Malaysia, Landslides, 7, 13, 10.1007/s10346-009-0183-2

Pradhan, 2009, Use of geospatial data for the development of fuzzy algebraic operators to landslide hazard mapping: a case study in Malaysia, Appl. Geomatics, 1, 3, 10.1007/s12518-009-0001-5

Pradhan, 2010, A GIS-based back-propagation neural network model and its cross application and validation for landslide susceptibility analyses, Comput. Environ. Urban. Syst., 34, 216, 10.1016/j.compenvurbsys.2009.12.004

Pradhan, 2010, Landslide susceptibility mapping by neuro-fuzzy approach in a landslide-prone area (Cameron Highlands, Malaysia), IEEE Trans. Geosci. Remote Sens., 48, 4164, 10.1109/TGRS.2010.2050328

Regmi, 2010, Modeling susceptibility to landslides using the weight of evidence approach: Western Colorado, USA, Geomorphology, 115, 172, 10.1016/j.geomorph.2009.10.002

Regmi, 2010, Assessing susceptibility to landslides: using models to understand observed changes in slopes, Geomorphology, 122, 25, 10.1016/j.geomorph.2010.05.009

Regmi, 2014, A comparison of logistic regression-based models of susceptibility to landslides in western Colorado, USA, Landslides, 11, 247, 10.1007/s10346-012-0380-2

Rozos, 2008, An implementation of rock engineering system for ranking the instability potential of natural slopes in Greek territory: an application in Karditsa County, Landslides, 5, 261, 10.1007/s10346-008-0117-4

Sabatakakis, 2013, Landslide susceptibility zonation in Greece, Nat. Hazards, 65, 523, 10.1007/s11069-012-0381-4

Saito, 2009, Comparison of landslide susceptibility based on a decision-tree model and actual landslide occurrence: the Akaishi mountains, Japan, Geomorphology, 109, 108, 10.1016/j.geomorph.2009.02.026

Sdao, 2013, Landslide susceptibility assessment by using a neuro-fuzzy model: a case study in the rupestrian heritage rich area of Matera, Nat. Hazards Earth Syst. Sci., 13, 395, 10.5194/nhess-13-395-2013

Sezer, 2011, Manifestation of an adaptive neuro-fuzzy model on landslide susceptibility mapping: Klang valley, Malaysia, Expert Syst. Appl., 38, 8208, 10.1016/j.eswa.2010.12.167

Shahabi, 2014, Landslide susceptibility mapping at central Zab basin, Iran: a comparison between analytical hierarchy process, frequency ratio and logistic regression models, Catena, 115, 55, 10.1016/j.catena.2013.11.014

Soeters, 1996, Slope instability recognition, analysis, and zonation, vol. 247

Soria, 2011, A “non-parametric” version of the naive Bayes classifier, Knowl.-Based Syst., 24, 775, 10.1016/j.knosys.2011.02.014

SPSS, Inc. Released, 2007

Suzen, 2011, Evaluation of environmental parameters in logistic regression models for landslide susceptibility mapping, Int. J. Digit. Earth, 5, 338, 10.1080/17538947.2011.586443

Tien Bui, 2012, Landslide susceptibility assessment in the Hoa Binh province of Vietnam using artificial neural network, Geomorphology, 171-172, 12, 10.1016/j.geomorph.2012.04.023

Tien Bui, 2012, Spatial prediction of landslide hazards in Vietnam: a comparative assessment of the efficacy of evidential belief functions and fuzzy logic models, Catena, 96, 28, 10.1016/j.catena.2012.04.001

Tien Bui, 2012, Landslide Susceptibility Assessment in Vietnam Using Support Vector Machines, Decision Tree, and Naïve Bayes Models, vol. 2012

Tien Bui, 2015, Spatial prediction models for shallow landslide hazards: a comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree, Landslides

Tien Bui, 2015, Spatial prediction models for shallow landslide hazards: a comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree, Landslides

Tsangaratos, 2014, Estimating landslide susceptibility through an artificial neural network classifier, Nat. Hazards, 74, 1489, 10.1007/s11069-014-1245-x

Tsangaratos, 2015, Landslide susceptibility mapping using a modified decision tree classifier in the Xanthi Prefecture, Greece, Landslides

Tsangaratos, 2013, Case event system for landslide susceptibility analysis, 585

Vahidnia, 2010, A GIS-based neuro-fuzzy procedure for integrating knowledgeand data in landslide susceptibility mapping, Comput. Geosci., 36, 1101, 10.1016/j.cageo.2010.04.004

Van Den Eeckhaut, 2006, Prediction of landslide susceptibility using rare events logistic regression: a case-study in the Flemish Ardennes (Belgium), Geomorphology, 76, 392, 10.1016/j.geomorph.2005.12.003

Van Den Eeckhaut, 2010, Comparison of two landslide susceptibility assessments in the Champagne–Ardenne region (France), Geomorphology, 115, 141, 10.1016/j.geomorph.2009.09.042

Van Westen, 2006, Landslide hazard and risk zonation — why is still so difficult?, Bull. Eng. Geol. Environ., 65, 167, 10.1007/s10064-005-0023-0

Varnes, 1984

Wang, 2013, Landslide susceptibility analysis with logistic regression model based on FCM sampling strategy, Comput. Geosci., 57, 81, 10.1016/j.cageo.2013.04.006

Weisberg, 2010

Xu, 2012, GIS-based support vector machine modeling of earthquake-triggered landslide susceptibility in the Jianjiang River watershed, China, Geomorphology, 145-146, 70, 10.1016/j.geomorph.2011.12.040

Yalcin, 2011, A GIS-based comparative study of frequency ratio, analytical hierarchy process, bivariate statistics and logistics regression methods for landslide susceptibility mapping in Trabzon, NE Turkey, Catena, 85, 274, 10.1016/j.catena.2011.01.014

Yao, 2008, Landslide susceptibility mapping based on support vector machine: a case study on natural slopes of Hong Kong, China, Geomorphology, 101, 572, 10.1016/j.geomorph.2008.02.011

Yeon, 2010, Landslide susceptibility mapping in Injae, Korea, using a decision tree, Eng. Geol., 16, 274, 10.1016/j.enggeo.2010.09.009

Yesilnacar, 2005, Landslide susceptibility mapping: a comparison of logistic regression and neural networks methods in a medium scale study, Hendek region (Turkey), Eng. Geol., 79, 251, 10.1016/j.enggeo.2005.02.002

Yilmaz, 2010, Comparison of landslide susceptibility mapping methodologies for Koyulhisar, Turkey: conditional probability, logistic regression, artificial neural networks, and support vector machine, Environ. Earth Sci., 61, 821, 10.1007/s12665-009-0394-9

Zare, 2013, Landslide susceptibility mapping at Vaz Watershed (Iran) using an artificial neural network model: a comparison between multilayer perceptron (MLP) and radial basic function (RBF) algorithms, Arab. J. Geosci., 6, 2873, 10.1007/s12517-012-0610-x

Zhu, 2014, An expert knowledge-based approach to landslide susceptibility mapping using GIS and fuzzy logic, Geomorphology, 214, 128, 10.1016/j.geomorph.2014.02.003