Black beans from different geographical origins exhibit variations in terms of their nutritional and economic values. Therefore, ensuring traceability of the geographical origin of black beans is essential for both consumers and the product processing industries. The study designed a comprehensive black bean origin identification model using near-infrared (NIR) spectroscopy combined with discriminant analysis algorithms. To enhance the performance of the model, this study incorporated fuzzy logic into regularized complete linear discriminant analysis (RCLDA) and proposed the fuzzy regularized complete linear discriminant analysis (FRCLDA). This innovative approach aims to extract additional information from overlapping NIR spectra of black beans. NIR spectra of black beans from five locations were collected employing a portable near-infrared spectrometer. In the preprocessing stage, the Savitzky–Golay (SG) and mean centering (MC) were applied to process the raw spectra. Subsequently, complete linear discriminant analysis (CLDA), RCLDA, and FRCLDA were utilized to extract the information from the processed data. Finally, k-nearest neighbor (KNN), support vector machines (SVM), and extreme learning machine (ELM) were employed to classify the black bean samples based on the extracted features. The results indicated that KNN achieved the best classification performance. The FRCLDA-KNN model achieved the exceptional performance with a classification accuracy of 99.23% on the test set. Therefore, the system demonstrates satisfactory accuracy and performance in accurately identifying black beans from different geographical origins.
Read full abstract