Estimating the total nitrogen content of Aquilaria sinensis leaves based on a hybrid feature selection algorithm and image data from a modified digital camera

Zhulin Chen,Xuefeng Wang,Shanshan Sun

doi:10.1016/j.biosystemseng.2021.11.021

Abstract

With the development of imaging devices and image processing algorithms, numerous features have come to be used for the estimation of total nitrogen content (TNC) in plants. However, higher-dimensional inputs contain more correlated variables that can detrimentally affect model performance. In this study, a hybrid feature selection approach was developed for TNC estimation in Aquilaria sinensis. A low-cost modified digital camera with external filters was used to capture canopy images. Three feature selection methods, namely, random forest (RF), Pearson correlation coefficient (PCC)-based feature selection, and sequential backward selection (SBS), were combined into two hybrid feature selection algorithms (RF_SBS and PCC_SBS). In addition, three regression algorithms were used in hybrid feature selection process: random forest regression (RFR), support vector regression (SVR), and partial least squares regression (PLSR). The hybrid feature selection process consists of two steps. First, the lowest number of dimensions is sought based on the feature ranking. Then, SBS is used to find the best feature combinations. Compared with the original models, the R2 values of the RF-SBS-based models are improved by 0.094 (RF_SBS_RFR), 0.190 (RF_SBS_SVR), and 0.116 (RF_SBS_PLSR), while the R2 values of the PCC-SBS-based models are improved by 0.055 (PCC_SBS_RFR), 0.092 (PCC_SBS_SVR) and 0.128 (PCC_SBS_PLSR). Finally, the two best TNC estimation models are found to be PCC_SBS_PLSR, with an R2 of 0.863, and RF_SBS_SVR, with an R2 of 0.872. The proposed hybrid feature selection approach not only has great capacity to improve estimation accuracy but also can reduce model complexity by choosing the best feature subset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Estimating the total nitrogen content of Aquilaria sinensis leaves based on a hybrid feature selection algorithm and image data from a modified digital camera

Abstract

Talk to us

Similar Papers

More From: Biosystems Engineering

Lead the way for us

Journal: Biosystems Engineering	Publication Date: Dec 9, 2021
Citations: 4

Similar Papers

Leaf Area Index Estimation Algorithm for GF-5 Hyperspectral Data Based on Different Feature Selection and Machine Learning Methods
Zhulin Chen ... Yuan Sun
Remote Sensing | VOL. 12
Zhulin Chen, et. al.Zhulin Chen ... Yuan Sun
01 Jul 2020
Remote Sensing | VOL. 12

A proposed framework for crop yield prediction using hybrid feature selection approach and optimized machine learning
Mahmoud Abdel-Salam ... Shubham Mahajan
Neural Computing and Applications | VOL. 36
Mahmoud Abdel-Salam, et. al.Mahmoud Abdel-Salam ... Shubham Mahajan
16 Aug 2024
Neural Computing and Applications | VOL. 36

Hyperspectral Modeling of Soil Organic Matter Based on Characteristic Wavelength in East China
Mingsong Zhao ... Yuanyuan Lu
Sustainability | VOL. 14
Mingsong Zhao, et. al.Mingsong Zhao ... Yuanyuan Lu
11 Jul 2022
Sustainability | VOL. 14

Extension of pQSAR: Ensemble Model Generated by Random Forest and Partial Least Squares Regressions
Byung Chun Kim ... Yongkuk Kim
IEEE Access | VOL. 8
Byung Chun Kim, et. al.Byung Chun Kim ... Yongkuk Kim
01 Jan 2020
IEEE Access | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Estimating the total nitrogen content of Aquilaria sinensis leaves based on a hybrid feature selection algorithm and image data from a modified digital camera

Abstract

Talk to us

Similar Papers

More From: Biosystems Engineering