Abstract

The accurate estimation of the spatial and temporal distribution of chlorophyll-a (Chl-a) concentrations in the South China Sea (SCS) is crucial for understanding marine ecosystem dynamics and water quality assessment. However, the challenge of missing values in satellite-derived Chl-a data has hindered obtaining complete spatiotemporal information. Traditional methods for deriving Chl-a are based on the modeling of measured sensor data and in situ measurements. Spatiotemporal imputation of Chl-a is difficult due to the inaccessibility of the measured Chl-a. In this study, we introduce an innovative approach that incorporates an ocean dynamics dataset and utilizes the random forest algorithm for predicting the Chl-a concentration in the SCS. The method combines the spatiotemporal feature pattern of Chl-a and the main influencing factors, and it introduces ocean dynamics data, which has a high correlation with the spatiotemporal distribution of Chl-a, as the input data through feature engineering. Also, we compared Random Forest (RF) with other Machine Learning (ML) methods. The results show that (1) ocean dynamics datasets can provide important data support for Chl-a imputation by capturing the impact of dynamical processes on ecological roles in the South China Sea. (2) The RF method is the superior imputation method for the reconstruction of Chl-a in the South China Sea, with better model performance and smaller errors. This study provides valuable insight for researchers and practitioners in choosing suitable machine learning methods for the imputation of the Chl-a concentration in the SCS, facilitating a better understanding of the region’s marine ecosystems and supporting effective environmental management.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call