Two alternative evaluation metrics to replace the true skill statistic in the assessment of species distribution models

Rainer Ferdinand Wunderlich,Johnathen Anthony,Joy R Petway,Yu-Pin Lin

doi:10.3897/natureconservation.35.33918

Rainer Ferdinand Wunderlich, Johnathen Anthony + Show 2 more

Open Access

https://doi.org/10.3897/natureconservation.35.33918

Copy DOI

Journal: Nature Conservation	Publication Date: Jun 20, 2019
Citations: 37	License type: CC BY 4.0

Affiliation: National Taiwan University

Abstract

Model evaluation metrics play a critical role in the selection of adequate species distribution models for conservation and for any application of species distribution modelling (SDM) in general. The responses of these metrics to modelling conditions, however, are rarely taken into account. This leads to inadequate model selection, downstream analyses and uniformed decisions. To aid modellers in critically assessing modelling conditions when choosing and interpreting model evaluation metrics, we analysed the responses of the True Skill Statistic (TSS) under a variety of presence-background modelling conditions using purely theoretical scenarios. We then compared these responses with those of two evaluation metrics commonly applied in the field of meteorology which have potential for use in SDM: the Odds Ratio Skill Score (ORSS) and the Symmetric Extremal Dependence Index (SEDI). We demonstrate that (1) large cell number totals in the confusion matrix, which is strongly biased towards ‘true’ absences in presence-background SDM and (2) low prevalence both compromise model evaluation with TSS. This is since (1) TSS fails to differentiate useful from random models at extreme prevalence levels if the confusion matrix cell number total exceeds ~30,000 cells and (2) TSS converges to hit rate (sensitivity) when prevalence is lower than ~2.5%. We conclude that SEDI is optimal for most presence-background SDM initiatives. Further, ORSS may provide a better alternative if absence data are available or if equal error weighting is strictly required.

Highlights

Species Distribution Modelling (SDM) relates independent environmental variables to species occurrence data and, in turn, predicts a dependent variable such as probability or the relative likelihood of occurrence (Guisan and Zimmermann 2000; Peterson 2001; Guillera-Arroita et al 2015)
We have shown that True Skill Statistic (TSS), Odds Ratio Skill Score (ORSS) and Symmetric Extremal Dependence Index (SEDI), as well as their underlying evaluation measures (H and F, see F in Table 2), show distinct responses to: 1) increasing size of the study area and, growing numbers of background points, even when prevalence is kept constant, 2) to the direction of bias as prevalence decreases and the extent of the study area and cell number totals increase and 3) to changes in bias as prevalence decreases and the extent of the study area and cell number totals increase
We focused on the importance of model evaluation in the context of ecology and conservation

Summary

Introduction

Species Distribution Modelling (SDM) relates independent environmental variables to species occurrence data and, in turn, predicts a dependent variable such as probability or the relative likelihood of occurrence (Guisan and Zimmermann 2000; Peterson 2001; Guillera-Arroita et al 2015). Even though SDM predictions mostly range from zero to one, SDM predictions are often discretised into binary presence-absence maps (i.e. comprising only zeros and ones) used to evaluate wildlife management options, to identify appropriate conservation translocation sites and to evaluate model performance (Willis et al 2009; Fordham et al 2012; Liu et al 2013) with confusion matrix-based performance metrics. ‘Observed false absences’, on the other hand, are artefactual in nature, resulting from insufficient monitoring relative to species movement (Tyre et al 2003) or imperfect detection (MacKenzie et al 2002) Whereas both true and false absences can lead to ‘zero-inflated’ datasets (Heilbron 1994) that violate statistical assumptions, the latter are a source of uncertainty in parameter estimates as artefactual signals (e.g. sampling bias, probability of detection) confounding estimates of probability of occurrence (MacKenzie et al 2002)

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Two alternative evaluation metrics to replace the true skill statistic in the assessment of species distribution models

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Nature Conservation

Lead the way for us

Similar Papers

The interplay of various sources of noise on reliability of species distribution models hinges on ecological specialisation.
Alaaeldin Soultan ... Kamran Safi
PLOS ONE | VOL. 12
Alaaeldin Soultan, et. al.Alaaeldin Soultan ... Kamran Safi
13 Nov 2017
PLOS ONE | VOL. 12

Using species distribution models at local scale to guide the search of poorly known species: Review, methodological issues and future directions
Mauro Fois ... Gianluigi Bacchetta
Ecological Modelling | VOL. 385
Mauro Fois, et. al.Mauro Fois ... Gianluigi Bacchetta
01 Aug 2018
Ecological Modelling | VOL. 385

Characterising social-ecological drivers of landuse/cover change in a complex transboundary basin using singular or ensemble machine learning
Blessing Kavhu ... Linda Luvuno
Remote Sensing Applications: Society and Environment | VOL. 27
Blessing Kavhu, et. al.Blessing Kavhu ... Linda Luvuno
10 May 2022
Remote Sensing Applications: Society and Environment | VOL. 27

Small-scale distribution modeling of benthic species in a protected natural hard ground area in the German North Sea (Helgoländer Steingrund)
Lydia R Becker ... Kai Bischof
Geo-Marine Letters | VOL. 40
Lydia R Becker, et. al.Lydia R Becker ... Kai Bischof
24 Oct 2019
Geo-Marine Letters | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Two alternative evaluation metrics to replace the true skill statistic in the assessment of species distribution models

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Nature Conservation