Accurate monitoring of nitrogen nutrition is crucial for improving cotton yield and quality, as well as the ecological environment. The mainstream method for monitoring nutrition is to establish traditional machine learning (ML) models using a single data source. However, this approach has limitations such as limited access to feature information, model-fitting problems, and limited generalization. Deep learning (DL), on the other hand, has shown promise in complex nonlinear modeling tasks due to its flexible structure. However, it has its limitations, such as the fact that agronomic sample collection and testing are usually labor- and material-intensive, resulting in sample sizes that are too small to meet its training conditions. Therefore, there is an urgent need for DL models that can effectively integrate features from multiple data sources and accurately monitor crop nitrogen content, especially in scenarios with small samples. In this study, we conducted indoor pot experiments using the cotton variety Xinluzao 53 and subjected it to six nitrogen treatments. The data sources for our analysis included hyperspectral and digital images of the cotton leaves. To enhance representation learning capabilities, we enriched the multi-class base learners within each layer of the deep forest (DF) model and introduced skip connections. These enhancements improved the quality of inversion for both hyperspectral and digital image datasets. We then developed image-spectral fusion models, which combined the DF structure with stacking ensemble learning. Our focus was on three levels of fusion: feature-level fusion, decision-level fusion, and secondary decision-level fusion. This approach aimed to further enhance the accuracy and stability of nitrogen content inversion. The DF model satisfied the training condition for small samples. Compared to traditional ML algorithms and the original DF algorithm, the improved DF model achieved an increase in validation set R2 of 13.4–28.5% and 10.9–14.9%, respectively. These findings highlight the enhanced accuracy and stability of the improved DF model. Additionally, compared to the optimal inversion model using two single data sources, the “Image-Spectral” three-level fusion models exhibited improvements in validation set R2 of 8.6–9.3%, 10.5–11.2%, and 11.8–12.5% for feature-level, decision-level, and secondary decision-level fusion, respectively. The improved DF and three-level fusion model collectively contributed to the increased accuracy of cotton nitrogen content inversion. Among these models, the secondary decision-level fusion model demonstrated the most marked improvement. This methodology provides valuable insights into monitoring crop phenotypic parameters in situations with limited sample sizes.
Read full abstract