Prediction and interpretation of pathogenic bacteria occurrence at a recreational beach using data-driven algorithms

Jiyi Jang,Ather Abbas,Hyein Kim,Chaeyoung Rhee,Seung Gu Shin,Jong Ahn Chun,Sangsoo Baek,Kyung Hwa Cho

doi:10.1016/j.ecoinf.2023.102370

Abstract

Recreational beaches face a threat from pathogenic bacteria that harbor antibiotic resistance genes (ARGs). To predict bacterial occurrence and comprehend their non-linear relationship with hydrometeorological features, advanced machine- and deep-learning algorithms were employed. These algorithms include regression trees (RT), as well as interpretable deep-learning algorithms such as the ‘Input Attention-Long Short-Term Memory (IA-LSTM)’ and ‘Temporal Fusion Transformer (TFT)’. Our focus was on predicting the occurrence of Prevotella, a prevalent pathogenic bacterium found at the beaches. Utilizing model-dependent and model-agnostic interpretation methods, which encompass sensitivity analysis, permutation, and the SHapley Additive exPlanations (SHAP) importance, we evaluated model behavior. RT-based algorithms exhibited predictive capabilities comparable to those of IA-LSTM and TFT, achieving validation Nash–Sutcliffe efficiencies of 0.93, 0.94, and 0.96, respectively. However, the deep-learning algorithms (IA-LSTM and TFT) are surpassed in terms of interpretability. The model-dependent interpretation method identified heavy precipitation as a pivotal hydrometeorological feature linked to increased Prevotella occurrence. Notably, the IA-LSTM identified Prevotella as a potential host for the sulfonamide resistance gene (sul1), suggesting the potential of Prevotella as an indicator for sul1. This research, leveraging interpretable data-driven models, advances our understanding of the hydrometeorological features influencing the occurrence of pathogenic bacteria and the prevalence of ARGs at the beach, and enhances predictive capabilities for bacterial occurrence.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Prediction and interpretation of pathogenic bacteria occurrence at a recreational beach using data-driven algorithms

Abstract

Talk to us

Similar Papers

More From: Ecological Informatics

Lead the way for us

Journal: Ecological Informatics	Publication Date: Nov 7, 2023
Citations: 4

Similar Papers

Prevalence and dissemination of antibiotic resistance genes and coselection of heavy metals in Chinese dairy farms
Bingrui Zhou ... Shaolin Wang
Journal of Hazardous Materials | VOL. 320
Bingrui Zhou, et. al.Bingrui Zhou ... Shaolin Wang
04 Aug 2016
Journal of Hazardous Materials | VOL. 320

Characterization and source identification of antibiotic resistance genes in the sediments of an interconnected river-lake system
Haiyang Chen ... Yanguo Teng
Environment International | VOL. 137
Haiyang Chen, et. al.Haiyang Chen ... Yanguo Teng
03 Feb 2020
Environment International | VOL. 137

Metagenomic insight into the prevalence and driving forces of antibiotic resistance genes in the whole process of three full-scale wastewater treatment plants
Ming Xu ... Jia-Shun Cao
Journal of Environmental Management | VOL. 344
Ming Xu, et. al.Ming Xu ... Jia-Shun Cao
23 Jun 2023
Journal of Environmental Management | VOL. 344

Prevalence of antibiotic resistance genes in wastewater collected from ornamental fish market in northern China.
Xuan Liu ... Hua Wang
Environmental Pollution | VOL. 271
Xuan Liu, et. al.Xuan Liu ... Hua Wang
15 Dec 2020
Environmental Pollution | VOL. 271

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Prediction and interpretation of pathogenic bacteria occurrence at a recreational beach using data-driven algorithms

Abstract

Talk to us

Similar Papers

More From: Ecological Informatics