Identification Framework of Contaminant Spill in Rivers Using Machine Learning with Breakthrough Curve Analysis

Siyoon Kwon,Donghae Baek,Sung Hyun Jung,Il Won Seo,Hyoseob Noh

doi:10.3390/ijerph18031023

Siyoon Kwon, Donghae Baek + Show 3 more

Open Access

https://doi.org/10.3390/ijerph18031023

Copy DOI

Abstract

To minimize the damage from contaminant accidents in rivers, early identification of the contaminant source is crucial. Thus, in this study, a framework combining Machine Learning (ML) and the Transient Storage zone Model (TSM) was developed to predict the spill location and mass of a contaminant source. The TSM model was employed to simulate non-Fickian Breakthrough Curves (BTCs), which entails relevant information of the contaminant source. Then, the ML models were used to identify the BTC features, characterized by 21 variables, to predict the spill location and mass. The proposed framework was applied to the Gam Creek, South Korea, in which two tracer tests were conducted. In this study, six ML methods were applied for the prediction of spill location and mass, while the most relevant BTC features were selected by Recursive Feature Elimination Cross-Validation (RFECV). Model applications to field data showed that the ensemble Decision tree models, Random Forest (RF) and Xgboost (XGB), were the most efficient and feasible in predicting the contaminant source.

Highlights

When accidental spills of contaminant occur in natural rivers, a rapid response is necessary to minimize the damage to both aquatic life and humans who depend on the river as a water resource
We focused on the optimal Breakthrough Curves (BTCs) features and Machine Learning (ML) models to predict the spill location and spill mass
The RMSE is the square root of Mean Absolute Error (MAE), which has consistent units of target variables

Summary

Introduction

When accidental spills of contaminant occur in natural rivers, a rapid response is necessary to minimize the damage to both aquatic life and humans who depend on the river as a water resource. Zhang and Xin [17] used the basic Genetic Algorithm (GA) to identify the spill location and spill mass of contaminant sources in a small straight river These optimization approaches have limitations of high uncertainties in their deterministic processes and the data used in the optimization [18]. They evaluated the proposed method regarding noise, and validated the model with the data from the real dye tracer test performed in the natural river, which is a significant process to test field applicability. Diffusion process contains many problems of spatial and temporal scale For this reason, data-driven approaches using contaminant spill scenarios to identify the location of the contaminant source were recently presented. The proposed models were applied to the field tracer data obtained in the river in order to ascertain the field applicability

Methodology

CAS Simulation

DT-Based

SVM and Ridge Regression

Feature Importance and Feature Selection

Modeling Performance Criteria

Study Site and Field Tracer Test

Figure

June 2020

Chemical Accident Scenarios in Gam Creek

Model Development

BTC Feature Importance for Inverse Tracking the Contaminant Source

Method

Field Application of ITM

Field Test of Spill Location Predictors

Field of Spill

Field Test of Spill Mass Predictors

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Environmental Research and Public Health	Publication Date: Jan 24, 2021
Citations: 11	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Identification Framework of Contaminant Spill in Rivers Using Machine Learning with Breakthrough Curve Analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Environmental Research and Public Health

Lead the way for us

Similar Papers

Sensors support machine learning
-
Food Science and Technology | VOL. 33
--
01 Dec 2019
Food Science and Technology | VOL. 33

P125. Development of a novel ensemble machine learning algorithm for prediction of complications and readmission after anterior cervical spinal fusion
Akash A Shah ... Don Y Park
The Spine Journal | VOL. 21
Akash A Shah, et. al.Akash A Shah ... Don Y Park
10 Aug 2021
The Spine Journal | VOL. 21

P126. Development of a novel ensemble machine learning algorithm for prediction of complications and readmission after posterior cervical spinal fusion
Amador Bugarin ... Elizabeth L Lord
The Spine Journal | VOL. 21
Amador Bugarin, et. al.Amador Bugarin ... Elizabeth L Lord
10 Aug 2021
The Spine Journal | VOL. 21

Seismic fragility analysis of steel moment frames using machine learning models
Myoungsu Shin ... Young-Joo Lee
Engineering Applications of Artificial Intelligence | VOL. 126
Myoungsu Shin, et. al.Myoungsu Shin ... Young-Joo Lee
15 Aug 2023
Engineering Applications of Artificial Intelligence | VOL. 126

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Identification Framework of Contaminant Spill in Rivers Using Machine Learning with Breakthrough Curve Analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Environmental Research and Public Health