Transferability of Machine Learning Models for Geogenic Contaminated Groundwaters.

Hailong Cao,Wenjing Liu,Xianjun Xie,Ziyi Xiao,Xianjun Xie,Ziyi Xiao

doi:10.1021/acs.est.4c01327

Abstract

Machine learning models show promise in identifying geogenic contaminated groundwaters. Modeling in regions with no or limited samples is challenging due to the need for large training sets. One potential solution is transferring existing models to such regions. This study explores the transferability of high fluoride groundwater models between basins in the Shanxi Rift System, considering six factors, including modeling methods, predictor types, data size, sample/predictor ratio (SPR), predictor range, and data informing. Results show that transferability is achieved only when model predictors are based on hydrochemical parameters rather than surface parameters. Data informing, i.e., adding samples from challenging regions to the training set, further enhances the transferability. Stepwise regression shows that hydrochemical predictors and data informing significantly improve transferability, while data size, SPR, and predictor range have no significant effects. Additionally, despite their stronger nonlinear capabilities, random forests and artificial neural networks do not necessarily surpass logistic regression in transferability. Lastly, we utilize the t-SNE algorithm to generate low-dimensional representations of data from different basins and compare these representations to elucidate the critical role of predictor types in transferability.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Transferability of Machine Learning Models for Geogenic Contaminated Groundwaters.

Abstract

Talk to us

Similar Papers

More From: Environmental science & technology

Lead the way for us

Similar Papers

The use of on-line co-training to reduce the training set size in pattern recognition methods: Application to left ventricle segmentation in ultrasound
G Carneiro ... J C Nascimento
-
G Carneiro, et. al.G Carneiro ... J C Nascimento
01 Jun 2012
01 Jun 2012

Pushing the limits of solubility prediction via quality-oriented data selection.
Murat Cihan Sorkun ... Süleyman Er
iScience | VOL. 24
Murat Cihan Sorkun, et. al.Murat Cihan Sorkun ... Süleyman Er
17 Dec 2020
iScience | VOL. 24

Machine Learning Prediction of Liver Allograft Utilization From Deceased Organ Donors Using the National Donor Management Goals Registry.
Andrew M Bishara ... Dieter Adelmann
Transplantation Direct | VOL. 7
Andrew M Bishara, et. al.Andrew M Bishara ... Dieter Adelmann
27 Sep 2021
Transplantation Direct | VOL. 7

Integrated clinical and genomic models using machine-learning methods to predict the efficacy of paclitaxel-based chemotherapy in patients with advanced gastric cancer
Jangwoo Lee ... Yoon Ji Choi
BMC Cancer | VOL. 24
Jangwoo Lee, et. al.Jangwoo Lee ... Yoon Ji Choi
20 Apr 2024
BMC Cancer | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transferability of Machine Learning Models for Geogenic Contaminated Groundwaters.

Abstract

Talk to us

Similar Papers

More From: Environmental science & technology