Abstract
Machine learning has been employed for various mapping and modeling tasks using input variables from different sources of remote sensing data. For feature selection involving high- spatial and spectral dimensionality data, various methods have been developed and incorporated into the machine learning framework to ensure an efficient and optimal computational process. This research aims to assess the accuracy of various feature selection and machine learning methods for estimating forest height using AISA (airborne imaging spectrometer for applications) hyperspectral bands (479 bands) and airborne light detection and ranging (lidar) height metrics (36 metrics), alone and combined. Feature selection and dimensionality reduction using Boruta (BO), principal component analysis (PCA), simulated annealing (SA), and genetic algorithm (GA) in combination with machine learning algorithms such as multivariate adaptive regression spline (MARS), extra trees (ET), support vector regression (SVR) with radial basis function, and extreme gradient boosting (XGB) with trees (XGbtree and XGBdart) and linear (XGBlin) classifiers were evaluated. The results demonstrated that the combinations of BO-XGBdart and BO-SVR delivered the best model performance for estimating tropical forest height by combining lidar and hyperspectral data, with R2 = 0.53 and RMSE = 1.7 m (18.4% of nRMSE and 0.046 m of bias) for BO-XGBdart and R2 = 0.51 and RMSE = 1.8 m (15.8% of nRMSE and −0.244 m of bias) for BO-SVR. Our study also demonstrated the effectiveness of BO for variables selection; it could reduce 95% of the data to select the 29 most important variables from the initial 516 variables from lidar metrics and hyperspectral data.
Highlights
Forest structural properties are critical information sources for measuring and monitoring aboveground biomass (AGB), which is used to predict the amount of carbon stock in forest stands [1,2,3]
This study explored the combination of lidar and hyperspectral data using different combinations of variable selection methods and machine learning algorithms to model forest heights in Robson Creek, Australia
72 machine learning models were generated from different input variables using different variable selection methods, and a wide range of model performance was obtained
Summary
Forest structural properties are critical information sources for measuring and monitoring aboveground biomass (AGB), which is used to predict the amount of carbon stock in forest stands [1,2,3]. The UNFCC has set up three levels of accuracy for mapping carbon emissions, from national to regional and global scales, where remote sensing has been recognized as one of the promising technologies for mapping these features. The development of remote sensing technologies that can map broader areas from regional (1000’s km2) to global (108 s km2) scales at high levels of detail has been widely explored for mapping and modeling forest structural properties. Mapping forest structural properties using remote sensing is still a challenging task, especially when dealing with complex forest environments; further assessment related to the uncertainties of using remote sensing methods for mapping forest structural properties is needed [10]
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.