Timely monitoring of potato yield status is crucial for guiding field management and ensuring the sustainability of food trade. In recent years, non-destructive remote sensing techniques have emerged as one of the most promising methods for crop yield monitoring. However, most existing potato yield estimation models have primarily been tested at specific growth stages within a single ecosystem, highlighting the urgent need for a model that can be applied across different ecological environments. In this study, we developed a semi-physical model based on hierarchical linear modeling (HLM) to automatically correct yield estimation based on remotely sensed vegetation indices using environmentally relevant variables such as net solar radiation at the surface (ssr), air temperature at 2 m above the surface (t2m), and soil water content in the root zone (swvl). The accuracy and transferability of the model were further evaluated by comparing it with aboveground destructive sampling methods, conventional empirical models, and machine learning approaches. The results demonstrated that site-specific relationships exist between aboveground agronomic traits parameters like biomass and leaf area index (LAI) with respect to yield estimation. Conventional linear statistical and machine learning methods faced challenges in transferring their constructed yield estimation models across regions and years. However, the HLM approach exhibited excellent generalization ability across all trial samples. In the middle and later stages of potato growth, the standardized LAI determining index proved to be the most effective vegetation for predicting yield, while ssr, sktmax, and shallow swvl were identified as key environmental variables affecting yield. By integrating these variables using HLM modeling techniques, we achieved optimal generalization performance (R2 = 0.57, RMSE = 7.65 t/ha, NRMSE = 25.02 % during tuber growth stage; R2 = 0.60, RMSE = 7.37 t/ha, NRMSE = 24.13 % during starch accumulation stage). This study underscores the importance of considering environmental factors when constructing a reliable model for estimating potato yield at different spatial and temporal scales using remote sensing technology.
Read full abstract