This study represents the first application of Sentinel-2 remote sensing imagery and model fusion techniques to assess the chlorophyll-a (Chla) concentration and turbidity in Nansi Lake, Shandong Province, China, from 2016 to 2022. First, we innovatively employed the stacking method to fuse eight fundamentally different Machine Learning (ML) models, each utilising 20 and 17 feature bands, resulting in the development of a robust algorithm for estimating the Chla concentration and turbidity in Nansi Lake. The results demonstrate that the Stacking Model has achieved outstanding theoretical generalisation capability. Second, the sensitivity of the model to extreme value data in the sample was quantified, and we found that compared with extreme gradient boosting (XGBoost), the optimal performance of the Stacking Model improved by 12%, to some extent, it solved the problem of high-value underestimation and low-value overestimation. The SHapley Additive exPlanations (SHAP) results revealed that features such as Three Bands, Enhanced Three, Rrs492/Rrs560, Rrs705/Rrs665 play a crucial role in estimating Chla concentration. For the turbidity estimation, the Normalized Difference Turbidity Index (NDTI), Rrs705+Rrs560, Rrs865-Rrs740 made significant contributions. Finally, we utilised the Stacking Model to create spatiotemporal maps of the Chla concentration and turbidity in Nansi Lake from 2016 to 2022. We analysed the causes of the water quality changes and explored the driving factors. Compared with previous studies, this paper provides a new idea for the monitoring of lake water quality parameters by using the high resolution of Sentinel-2 image and the high precision of model fusion technology, these results can provide a reference for similar water area research and decision-making support for environment-related departments.
Read full abstract