Abstract

The estimation and mapping of forest stand characteristics are vital because this information is necessary for sustainable forest management. The present study considers the use of a Bayesian additive regression trees (BART) algorithm as a non-parametric classifier using Sentinel-2A data and topographic variables to estimate the forest stand characteristics, namely the basal area (m2/ha), stem volume (m3/ha), and stem density (number/ha). These results were compared with those of three other popular machine learning (ML) algorithms, such as generalised linear model (GLM), K-nearest neighbours (KNN), and support vector machine (SVM). A feature selection was done on 28 variables including the multi-spectral bands on Sentinel-2 satellite, related vegetation indices, and ancillary data (elevation, slope, and topographic solar-radiation index derived from digital elevation model (DEM)) and then the most insignificant variables were removed from the datasets by recursive feature elimination (RFE). The study area was a mountainous forest with high biodiversity and an elevation gradient from 26 to 1636 m. An inventory dataset of 1200 sample plots was provided for training and testing the algorithms, and the predictors were fed into the ML models to compute and predict the forest stand characteristics. The accuracies and certainties of the ML models were assessed by their root mean square error (RMSE), mean absolute error (MAE), and R-squared (R2) values. The results demonstrated that BART generated the best basal area and stem volume predictions, followed by GLM, SVM, and KNN. The best RMSE values for both basal area (8.12 m2/ha) and stem volume (29.28 m3/ha) estimation were obtained by BART. Thus, the ability of the BART model for forestry application was established. On the other hand, KNN exhibited the highest RMSE values for all stand variable predictions, thereby exhibiting the least accuracy for this specific application. Moreover, the effectiveness of the narrow Sentinel-2 bands around the red edge and elevation was highlighted for predicting the forest stand characteristics. Therefore, we concluded that the combination of the Sentinel-2 products and topographic variables derived from the PALSAR data used in this study improved the estimation of the forest attributes in temperate forests.

Highlights

  • Forests cover around 30% of the Earth’s surface and are one of the main sources of human supplies and services [1]; sustainable development, climate change mitigation, and bio-diversity preservation can be achieved by forestry and forest management [2]

  • To determine the basal area, B5, B6, B8A, B11, B12, inverted red-edge chlorophyll index (IRECI), normalised difference index 45 (NDI45), pigment specific simple ratio (PSSRA), elevation, slope, and TRASP were selected by recursive feature elimination (RFE)

  • We compared the differences between the predicted and observed values using the mean absolute error (MAE): the lowest MAE was achieved by Bayesian additive regression trees (BART) for the basal area (6.88 m2/ha) and stem volume (24.53 m3/ha) estimations, whereas the best MAE for stem density estimation was achieved by generalised linear model (GLM) (43.87 n/ha)

Read more

Summary

Introduction

Forests cover around 30% of the Earth’s surface and are one of the main sources of human supplies and services [1]; sustainable development, climate change mitigation, and bio-diversity preservation can be achieved by forestry and forest management [2]. Continuous forest management is crucial, which requires precise knowledge of forest characteristics through detailed information extraction [3]. Spatiotemporal change detection, logging, and evaluating the forest management regime in the region can bring more transparency into the management and ecosystem services in the forest [4]. The Sentinel satellites continuously map and monitor vast forest regions using high spatial, spectral and temporal resolution data, but at low costs [5,8]. The operation of the Sentinel-2 satellite provides multi-spectral data in 13 bands, with a spatial resolution of 10 to 60 m, 10-day revisiting period, and 290 km swath width

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call