Abstract

Machine learning algorithms are often used to model and predict animal habitat selection—the relationships between animal occurrences and habitat characteristics. For broadly distributed species, habitat selection often varies among populations and regions; thus, it would seem preferable to fit region- or population-specific models of habitat selection for more accurate inference and prediction, rather than fitting large-scale models using pooled data. However, where the aim is to make range-wide predictions, including areas for which there are no existing data or models of habitat selection, how can regional models best be combined? We propose that ensemble approaches commonly used to combine different algorithms for a single region can be reframed, treating regional habitat selection models as the candidate models. By doing so, we can incorporate regional variation when fitting predictive models of animal habitat selection across large ranges. We test this approach using satellite telemetry data from 168 humpback whales across five geographic regions in the Southern Ocean. Using random forests, we fitted a large-scale model relating humpback whale locations, versus background locations, to 10 environmental covariates, and made a circumpolar prediction of humpback whale habitat selection. We also fitted five regional models, the predictions of which we used as input features for four ensemble approaches: an unweighted ensemble, an ensemble weighted by environmental similarity in each cell, stacked generalization, and a hybrid approach wherein the environmental covariates and regional predictions were used as input features in a new model. We tested the predictive performance of these approaches on an independent validation dataset of humpback whale sightings and whaling catches. These multiregional ensemble approaches resulted in models with higher predictive performance than the circumpolar naive model. These approaches can be used to incorporate regional variation in animal habitat selection when fitting range-wide predictive models using machine learning algorithms. This can yield more accurate predictions across regions or populations of animals that may show variation in habitat selection.

Highlights

  • Modelling the habitat selection of animals—that is, estimating functions of their disproportionate use of habitats characterized by a set of covariates—is a frequent task in animal ecology, to address both fundamental and applied questions [1,2]

  • To estimate the habitat selection of humpback whales, we used a case–control design [17], wherein we modelled the environmental characteristics of the locations where whales were present compared to the environmental characteristics of locations that whales could potentially have used

  • In our case study using satellite telemetry data for humpback whales across the Southern Ocean, we show that three approaches lead to more accurate predictions of an independent validation dataset of humpback whale catches and sightings, compared to the naive circumpolar model

Read more

Summary

Introduction

Two habitat selection models fitted using satellite tracking data for grey petrels (Procellaria cinerea) from two Southern Ocean archipelagos performed poorly when predicted between the two archipelagos, and when validated with tracking data from a third archipelago [8]. In this case, the poor transferability of the two models indicated that they were not generalizable (the models did not extrapolate well across sites), even though the habitat selection of petrels from the two archipelagos was broadly similar [8]

Objectives
Methods
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call