Abstract

Seafloor habitat maps are an important management tool used to delineate distinct regions of the seabed based on their biophysical properties. Spatially continuous bathymetry and backscatter-derived terrain features are commonly used as proxies for environmental conditions and processes that affect species distributions. Multi-scale approaches are increasingly applied to assess the relevant scales at which species co-occur. As the optimal scale(s) may be unknown, features can be calculated at multiple successive scales, yet this results in numerous highly correlated features that may negatively impact model interpretability. To address this increased dimensionality, feature selection approaches can be used to identify the most relevant features. Here, filter and wrapper approaches are assessed to select features from a highly dimensional multi-scale dataset. Terrain features describing the seabed were calculated across ten scales at two coastal sites in Placentia Bay, Newfoundland, Canada. Five species assemblages were identified using ground-truth underwater video sampling. Features predicting the presence of assemblages were assessed using the two selection methods, and the set of chosen features was modelled using three machine learning algorithms: extreme gradient boosting (XGB), random forest (RF), and support vector machines (SVM). The XGB model with features selected by scale-factor from the Boruta wrapper algorithm had the highest accuracy according to cross-validation- (61.67%, kappa 0.49). Bathymetry and terrain attributes were the most important predictors of assemblage occurrence across various analysis scales encompassing both broader and fine-scale variability of the seabed. The proposed feature reduction and selection approach improved the overall accuracy of predictions, and the resulting biological complexity captured in our habitat maps established baseline data for an ecologically significant coastal region.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call