Abstract

Plant species diversity (PSD) has always been an essential component of biodiversity and plays an important role in ecosystem functions and services. However, it is still a huge challenge to simulate the spatial distribution of PSD due to the difficulties of data acquisition and unsatisfactory performance of predicting algorithms over large areas. A surge in the number of remote sensing imagery, along with the great success of machine learning, opens new opportunities for the mapping of PSD. Therefore, different machine learning algorithms combined with high-accuracy surface modeling (HASM) were firstly proposed to predict the PSD in the Xinghai, northeastern Qinghai-Tibetan Plateau, China. Spectral reflectance and vegetation indices, generated from Landsat 8 images, and environmental variables were taken as the potential explanatory factors of machine learning models including least absolute shrinkage and selection operator (Lasso), ridge regression (Ridge), eXtreme Gradient Boosting (XGBoost), and Random Forest (RF). The prediction generated from these machine learning methods and in situ observation data were integrated by using HASM for the high-accuracy mapping of PSD including three species diversity indices. The results showed that PSD was closely associated with vegetation indices, followed by spectral reflectance and environmental factors. XGBoost combined with HASM (HASM-XGBoost) showed the best performance with the lowest MAE and RMSE. Our results suggested that the fusion of heterogeneous data and the ensemble of heterogeneous models may revolutionize our ability to predict the PSD over large areas, especially in some places limited by sparse field samples.

Highlights

  • Plant species diversity (PSD) is an essential component of biodiversity and composed of species richness and evenness (Mcintosh and Odum 1969)

  • The objectives of this paper were as follows: (1) explore and identify the factors that have a great impact on the spatial distribution of PSD; (2) demonstrate the feasibility of our proposed ensemble learning models for the mapping of PSD in a large area with sparse data, especially the places that are hard and costly to reach for human beings; (3) map PSD using eight different machine learning algorithms combined with high-accuracy surface modeling (HASM) fused in-situ observations and remote sensing images in the first time

  • Sensed variables and environmental factors were fused to predict the distribution of PSD by using machine learning algorithms combined with HASM

Read more

Summary

Introduction

Plant species diversity (PSD) is an essential component of biodiversity and composed of species richness and evenness (Mcintosh and Odum 1969). Richness takes into account individual species, while evenness represents the relative abundance of species. As an important indicator of the abundance of biological resource in habitats, and has a huge effect on ecosystem functions (Cardinale et al 2012), and ecosystem services (Dong et al 2020; Fauvel et al 2020; Liu et al 2018b). The mapping of PSD, has drawn much attention (Aggemyr et al 2018; Schuler et al 2019; Wan et al 2020). It is necessary and urgent to develop a novel model to estimate the current state of diversity, which is essential for the government planning and management

Objectives
Methods
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call