Abstract

Rice is one of the three major crops in the world. The hyperspectral identification of rice seeds is of great significance to the development of agriculture and hence has made remarkable achievements. However, the lack of publicly available benchmark datasets with a large number of rice seeds hinders the generalization of models and limits the advancement of the field. To address this issue, this study proposes a publicly accessible dataset for rice seed classification by hyperspectral imaging system. The proposed dataset contains 6 categories of nearly 10,000 seeds each and is characterized by (1) a large scale on the number of seeds per category and the total seed number; (2) a greater adherence to real-world scenarios; and (3) high intra-class diversity and high inter-class similarity. Based on the proposed dataset, an Instance Difficulty-weighted K-Nearest Neighbor algorithm (IDKNN) is further proposed. It is a credible algorithm, emphasizing individual seed accuracy rather than overall performance. Additionally, we explore the effects of different Regions Of Interest (ROI). Experimental results constitute the baseline performance of the proposed dataset. Compared with 10 representative algorithms, the IDKNN algorithm achieves the best performance, especially on the ROI of the entire seed. The dataset can be downloaded from https://github.com/fliery/RSHI60T.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call