Abstract

Remote sensing based biomass estimation in arid environments is essential for monitoring degradation and carbon dynamics. However, due to the low vegetation cover in these regions, satellite-based research is challenging. Numerous potentially useful remotely-sensed predictor variables have been proposed, and several statistical and machine-learning techniques are available for empirical spatial modeling, but their predictive performance is yet unknown in this context. We therefore modeled total biomass in the Eastern Pamirs of Tajikistan, a region with extremely low vegetation cover, with a large set of satellite based predictors derived from two commonly used sensors (Landsat OLI, RapidEye), and assessed their utility in this environment using several suitable modeling approaches (stepwise, lasso, partial least squares and ridge regression, random forest). The best performing model (lasso regression) resulted in a RMSE of 992kgha−1 in spatial cross-validation, indicating that biomass quantification in this arid setting is feasible but subject to large uncertainties. Furthermore, pronounced over-fitting in some commonly used models (e.g. stepwise regression, random forest) underlined the importance of adequate variable selection and shrinkage techniques in spatial modeling of high dimensional data. The applied sensors showed very similar performance and a combination of both only slightly improved results of better performing models. A permutation-based assessment of variable importance showed that some of the most frequently used vegetation indices are not suitable for dwarf shrub biomass prediction in this environment. We suggest that predictor variables based on several bands accounting for vegetation as well as background information are required in this arid setting.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call