Abstract

Variable contribution estimation for, and determination of variable importance within, ecological niche models (ENMs) remain an important area of research with continuing challenges. Most ENM algorithms provide normally exhaustive searches through variable space; however, selecting variables to include in models is a first challenge. The estimation of the explanatory power of variables and the selection of the most appropriate variable set within models can be a second challenge. Although some ENMs incorporate the variable selection rubric inside the algorithms, there is no integrated rubric to evaluate the variable importance in the Genetic Algorithm for Ruleset Production (GARP). Here, we designed a novel variable selection methodology based on the rulesets generated from a GARP experiment. The importance of the variables in a GARP experiment can be estimated based on the consideration of the prevalence of each environmental variable in the dominant presence rules of the best subset of models and its coverage. We tested the performance of this variable selection method based on simulated species with both weak and strong responses to simulated environmental covariates. The variable selection method generally performed well during the simulations with over 2/3 of the trials correctly identifying most covariates. We then predict the distribution of Toxostoma rufum (a bird with a cosmopolitan distribution) in the continental United States (US) and apply our variable selection procedure as a real-world example. We found that the distribution of T. rufum could be accurately modeled with 13 or 10 of 21 variables, using an UI cutoff of 0.5 or 0.25, respectively, arriving at parsimonious environmental coverages with good model accuracy. We also provide tools to simulate species distributions for testing ENM approaches using R.

Highlights

  • Species distribution models (SDMs; i.e. ecological niche models [ENMs]) have been widely applied in ecology, biogeography, conservation biology, evolution, and epidemiology over the past several decades (Larson et al, 2010; Ostfeld et al, 2005; Pearson and Dawson, 2003; Peterson and Vieglais, 2001)

  • We found that Unimportance Index (UI) and Genetic Algorithm for Ruleset Production (GARP) performed well during the simulations

  • We present a new variable selection rubric for GARP based on prevalence rates and median ranges of the variables in the dominant presence rules in best subsets

Read more

Summary

Introduction

Species distribution models (SDMs; i.e. ecological niche models [ENMs]) have been widely applied in ecology, biogeography, conservation biology, evolution, and epidemiology over the past several decades (Larson et al, 2010; Ostfeld et al, 2005; Pearson and Dawson, 2003; Peterson and Vieglais, 2001). Modeling a species’ geographic distribution relies on some form of pattern-recognition based on non-random association between the geographic occurrences of a species and environmental conditions that support its survival under the ecological niche theory (Araujo and Guisan, 2006; Hutchinson, 1957). Species’ distributions and their environmental requirements can be veiled or misleading due to the selection of inappropriate predictors (Araujo and Guisan, 2006).

Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.