Making better Maxent models of species distributions: complexity, overfitting and evaluation

Journal of Biogeography - Tập 41 Số 4 - Trang 629-643 - 2014
Aleksandar Radosavljević1, Robert P. Anderson1,2,3
1Department of Biology, City College of the City University of New York, New York, NY 10031 USA
2Division of Vertebrate Zoology (Mammalogy), American Museum of Natural History, New York, NY 10024, USA
3Graduate Center of the City University of New York, New York, NY, 10016 USA

Tóm tắt

AbstractAim

Models of species niches and distributions have become invaluable to biogeographers over the past decade, yet several outstanding methodological issues remain. Here we address three critical ones: selecting appropriate evaluation data, detecting overfitting, and tuning program settings to approximate optimal model complexity. We integrate solutions to these issues for Maxent models, using the Caribbean spiny pocket mouse, Heteromys anomalus, as an example.

Location

North‐western South America.

Methods

We partitioned data into calibration and evaluation datasets via three variations of k‐fold cross‐validation: randomly partitioned, geographically structured and masked geographically structured (which restricts background data to regions corresponding to calibration localities). Then, we carried out tuning experiments by varying the level of regularization, which controls model complexity. Finally, we gauged performance by quantifying discriminatory ability and overfitting, as well as via visual inspections of maps of the predictions in geography.

Results

Performance varied among data‐partitioning approaches and among regularization multipliers. The randomly partitioned approach inflated estimates of model performance and the geographically structured approach showed high overfitting. In contrast, the masked geographically structured approach allowed selection of high‐performing models based on all criteria. Discriminatory ability showed a slight peak in performance around the default regularization multiplier. However, regularization levels two to four times higher than the default yielded substantially lower overfitting. Visual inspection of maps of model predictions coincided with the quantitative evaluations.

Main conclusions

Species‐specific tuning of model parameters can improve the performance of Maxent models. Further, accurate estimates of model performance and overfitting depend on using independent evaluation data. These strategies for model evaluation may be useful for other modelling methods as well.

Từ khóa


Tài liệu tham khảo

10.1206/0003-0082(2003)396<0001:TDANHO>2.0.CO;2

10.1046/j.1365-2699.2003.00867.x

10.1111/j.1749-6632.2011.06440.x

10.1111/nyas.12264

10.1016/j.ecolmodel.2011.04.011

10.1206/582-2.1

10.1111/j.1365-2699.2010.02290.x

10.1046/j.1466-822X.2002.00275.x

Anderson R.P., 2002, Using niche‐based GIS modeling to test geographic predictions of competitive exclusion and competitive release in South American pocket mice, Oikos, 11, 131

10.1111/j.1365-2699.2006.01584.x

10.1126/science.1131758

10.1111/j.1365-2486.2005.01000.x

10.1111/j.1466-822X.2005.00182.x

10.1111/j.1600-0706.2012.00299.x

10.1111/j.1600-0587.2010.06181.x

10.1016/j.ecolmodel.2011.02.011

10.1371/journal.pbio.1000385

10.1016/S0304-3800(02)00200-4

10.1111/j.2006.0906-7590.04596.x

10.1111/j.2041-210X.2010.00036.x

10.1111/j.1472-4642.2010.00725.x

10.1111/j.1461-0248.2005.00792.x

10.1890/06-0539

10.1111/j.0906-7590.2006.04700.x

10.1111/j.1365-2699.2004.01163.x

10.1890/11-0826.1

10.1111/j.1365-2486.2006.01256.x

10.1111/j.0030-1299.2008.16434.x

Huber O., 1997, Vertebrados acutales y fosiles de Venezeula, 279

10.1577/T03-172.1

10.1644/08-MAMM-A-243.1

10.1007/s10530-011-9963-4

10.1016/j.tree.2008.02.001

10.1016/S0304-3800(02)00195-3

10.1111/j.1466-8238.2007.00358.x

10.1111/j.1365-2699.2007.01779.x

10.1111/j.2006.0030-1299.15050.x

10.1111/j.1466-8238.2009.00476.x

10.1016/S0304-3800(02)00198-9

10.1111/j.1472-4642.2007.00344.x

10.1046/j.1466-822X.2003.00042.x

10.1111/j.1365-2699.2006.01594.x

10.1038/nclimate1858

10.1086/378926

10.17161/bi.v3i0.29

10.1111/j.0906-7590.2007.05102.x

10.1016/j.ecolmodel.2007.11.008

10.23943/princeton/9780691136868.001.0001

10.1111/j.0906-7590.2008.5378.x

10.1111/j.0906-7590.2008.5203.x

10.1890/09-0760.1

10.1016/j.ecolmodel.2005.03.026

10.1890/07-2153.1

10.1111/j.1365-2699.2006.01466.x

10.1080/10635150701775111

10.1046/j.1365-2699.2003.00946.x

10.1111/j.1541-0420.2012.01824.x

10.1016/j.ecolmodel.2013.08.011

10.1111/j.0906-7590.2004.03673.x

10.1111/j.1365-2699.2009.02174.x

10.1890/10-1171.1

10.1890/070037

10.1111/j.1442-9993.2005.01514.x

10.1111/j.1472-4642.2008.00482.x