Abstract

Near-infrared spectroscopy (NIRS) is a rapid, nondestructive analytical technique utilized in various fields. However, the NIR data, which consists of hundreds of dimensions, may exhibit considerable duplication in the spectrum information. This redundancy might impair modeling effectiveness. As a result, feature selection on the spectral data becomes critical. The Max-Relevance Min-Redundancy (mRMR) method stands out among the different feature selection techniques for dimensional reduction. The approach depends on mutual information (MI) between random variables as the basis for feature selection and is unaffected by modeling methods. However, it is necessary to clarify the benefits of the maximum correlation minimal redundancy algorithm in the context of near-infrared spectral feature selection, as well as its adaptability to various modeling methods. This research focuses on the NIR spectral dataset of maize germination rate, and the mRMR method is utilized to select spectral features. Based on the preceding foundation, we create models for Support Vector Regression, Gaussian Process Regression, Random Forest, and Neural Networks. The experimental findings demonstrate that, among the feature selection methods employed in this paper, the Max-Relevance Min-Redundancy algorithm outperforms others regarding the corn germination rate dataset.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call