Abstract

In near-infrared spectroscopy analysis, the predictive performance of models may degrade due to variations in the measurement environment and sample properties. Maintenance or updates to the model are essential to uphold prediction accuracy. The state-of-the-art approach for model updates involves labeling new samples and incorporating them into the calibration set. However, this strategy leads to an escalating size of the calibration dataset, and the low ratio of new samples renders the model update process inefficient. To enhance update efficiency, a size-stable updating strategy is presented which involves the simultaneous addition of new samples and removal of old samples. This is achieved by utilizing the new set as a basis for identifying and deleting significantly different old labeled samples. To improve labeling efficiency, a batch mode update is employed. Initially, calibration samples are clustered to yield multiple clusters with comparable sizes to the new update set. Subsequently, differences between multiple old sample clusters and the new set are calculated. The old sample cluster that most different are removed, accompanied by the addition of the new sample set to maintain calibration set size stability. In calculating set similarity, a multi-indices fusion method is employed to ensure accurate judgments. The efficacy of this method is validated through simulated and real data.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call