The mislabelled Khao Dawk Mali 105 rice coming from other geographical region outside the Thung Kula Rong Hai region is extremely profitable and difficult to detect; to prevent retail fraud (that adversely affects both the food industry and consumers), it is vital to identify geographical origin. Near infrared spectroscopy can be used to detect the specific content of organic moieties in agricultural and food products. The present study implemented the combinatorial method of FT-NIR spectroscopy with chemometrics to identify geographical origin of Khao Dawk Mali 105 rice. Rice samples were collected from 2 different region including the north and northeast of Thailand. NIR spectra data were collected in range of 12,500 – 4,000 cm−1 (800–2,500 nm). Five machine learning algorithms including linear discriminant analysis (LDA), partial least squares discriminant analysis (PLS-DA), C-support vector classification (C-SVC), backpropagation neural networks (BPNN), hybrid principal component analysis-neural network (PC-NN) and K-nearest neighbors (KNN) were employed to classify NIR data of rice samples with full wavelength and selected wavelength by Extremely Randomized Trees (Extra trees) algorithm. Based on the findings, geographical origin of rice could be specified quickly, cheaply, and reliably using combination of NIRS and machine learning. All models creating by full wavelength and selected wavelength exhibited accuracy between 65 and 100 % for identifying geographical region of rice. It was proven that NIR spectroscopy may be used for the quick and non-destructive identification of geographical origin of Khao Dawk Mali 105 rice.
Read full abstract