Most existing studies have focused on identifying the origin of species with protected geographical indications while neglecting to determine the proximate geographical origin of different species. In this study, we investigated the feasibility of using near- and mid-infrared spectroscopy to identify the origin of 156 Polygonatum kingianum samples from six regions in Yunnan, China. In this work, spectral images of different modes reveal more information about the P. kingianum. Comparing the performance of traditional machine learning models according to single spectrum and data fusion, the middle-level data fusion-principal component model has the best performance, and its sensitivity, specificity, and accuracy are all 1, and the model has the least number of variables. The residual convolutional neural network (ResNet) model constructed in the 1050-850cm-1 band confirms that fewer variables are beneficial in improving the accuracy of the model. In conclusion, this study verifies the feasibility of the proposed strategy and establishes a practical model to determine the source of P. kingianum.
Read full abstract