Abstract

Dissolved oxygen (DO) serves as a pivotal indicator, mirroring the intrinsic self-purification capacity of aquatic ecosystems and the overarching quality of the water environment. In the context of the Yangtze Estuary, a crucial hub for biodiversity and economic activities in China, understanding and forecasting levels of DO is instrumental for effective environmental stewardship and management strategies. Considering this, the introduction of sophisticated machine learning algorithms into the monitoring and predictive analytics of dissolved oxygen levels represents an important stride toward leveraging the power of data-driven insights for environmental sustainability. The Yangtze Estuary, characterized by its dynamic and complex hydrological and ecological systems, demands an insightful and nuanced approach to monitoring water quality parameters. To this end, six key monitoring stations were chosen across the estuary, including Xuliujing, Nantong Port, Qidong Port, Qinglong Port, South Port, and North Port, acting as sentinel sites for gauging the health of the water body. Leveraging three cutting-edge modeling techniques-particle swarm optimization-support vector regression (PSO-SVR), artificial neural network (ANN), and random forest (RF)-the research unraveled and forecasted the patterns of dissolved oxygen levels using monthly average water quality data spanning from 2004 to 2020. These models embodied the forefront of machine learning technology, each bringing distinct analytical strengths and perspectives to the table, from the nuanced, non-linear pattern recognition capabilities of ANN to the robustness and interpretability of RF. The meticulous evaluation conducted via the RF model underscored the paramount importance of three water quality variables, namely temperature, five-day biochemical oxygen demand, and ammonia nitrogen, in influencing the spatial-temporal dynamics of dissolved oxygen in the estuary. Comparative analysis of the prediction results yielded by the PSO-SVR, ANN, and RF models illuminated the superior performance of the RF model across the six monitoring stations, with an overall average error margin of 0.19, a testament to its efficacy and reliability. In comparison, the PSO-SVR and ANN models exhibited higher error rates of 0.38 and 0.47, respectively, albeit still contributing valuable insights into the complex dissolved oxygen dynamics in the Yangtze Estuary. The prediction performance of the machine learning models was evaluated, and the overall prediction performance ranking on the training set was RF (R2=0.971; RMSE=0.341 mg·L-1) > PSO-SVR (R2=0.884; RMSE=0.707 mg·L-1) > ANN (R2=0.792; RMSE=0.967 mg·L-1). The overall prediction performance ranking on the test set was RF (R2 = 0.986; RMSE=0.165 mg·L-1) > PSO-SVR (R2=0.951; RMSE=0.332 mg·L-1) > ANN (R2=0.800; RMSE=0.633 mg·L-1). Therefore, the RF model exhibited the best predictive ability on all monitoring sections, showing excellent performance and generalization ability both on the training and the test sets. The PSO-SVR model also performed well on most monitored profiles, with slightly lower predictive performance than that of the RF model though with better stability and generalization ability. However, the ANN model did not perform as perfectly as the other two models in some monitoring profiles and its network structure or parameters may need to be further optimized to improve the prediction accuracy and stability.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.