Visible and near-infrared (Vis-NIR) spectroscopy has been widely applied in many fields for the qualitative and quantitative analysis. Chemometric techniques including pre-processing, variable selection, and multivariate calibration models play an important role to better extract useful information from spectral data. In this study, a new de-noising method (lifting wavelet transform, LWT), four variable selection methods, as well as two non-linear machine learning models were simultaneously analyzed to compare the impact of chemometric approaches on wood density determination among various tree species and geographical locations. In addition, fruit fly optimization algorithm (FOA) and response surface methodology (RSM) were employed to optimize the parameters of generalized regression neural network (GRNN) and particle swarm optimization-support vector machine (PSO-SVM), respectively. As for various chemometric methods, the optimal chemometric method was different for the same tree species collected from different locations. FOA-GRNN model combined with LWT and CARS deliver the best performance for Chinese white poplar of Heilongjiang province. In contrast, PLS model showed a good performance for Chinese white poplar collected from Jilin province based on raw spectra. However, for other tree species, RSM-PSO-SVM models can improve the performance of wood density prediction compared to traditional linear and FOA-GRNN models. Especially for Acer mono Maxim, when compared to linear models, the coefficient of determination of prediction set ( ) and relative prediction deviation (RPD) were increased by 47.70% and 44.48%, respectively. And the dimensionality of Vis-NIR spectral data was decreased from 2048 to 20. Therefore, the appropriate chemometric technique should be selected before building calibration models.
Read full abstract