Assessing putative bias in prediction of anti-microbial resistance from real-world genotyping data under explicit causal assumptions

Mattia Prosperi,Christina Boucher,Jiang Bian,Simone Marini

doi:10.1016/j.artmed.2022.102326

Mattia Prosperi, Christina Boucher + Show 2 more

Open Access

https://doi.org/10.1016/j.artmed.2022.102326

Copy DOI

Journal: Artificial Intelligence in Medicine	Publication Date: Jun 3, 2022
Citations: 1	License type: publisher-specific-oa

Affiliation: University of Florida

Abstract

Whole genome sequencing (WGS) is quickly becoming the customary means for identification of antimicrobial resistance (AMR) due to its ability to obtain high resolution information about the genes and mechanisms that are causing resistance and driving pathogen mobility. By contrast, traditional phenotypic (antibiogram) testing cannot easily elucidate such information. Yet development of AMR prediction tools from genotype-phenotype data can be biased, since sampling is non-randomized. Sample provenience, period of collection, and species representation can confound the association of genetic traits with AMR. Thus, prediction models can perform poorly on new data with sampling distribution shifts. In this work –under an explicit set of causal assumptions– we evaluate the effectiveness of propensity-based rebalancing and confounding adjustment on antibiotic resistance prediction using genotype-phenotype AMR data from the Pathosystems Resource Integration Center (PATRIC). We select bacterial genotypes (encoded as k-mer signatures, i.e., DNA fragments of length k), country, year, species, and AMR phenotypes for the tetracycline drug class, preparing test data with recent genomes coming from a single country. We test boosted logistic regression (BLR) and random forests (RF) with/without bias-handling. On 10,936 instances, we find evidence of species, location and year imbalance with respect to the AMR phenotype. The crude versus bias-adjusted change in effect of genetic signatures on AMR varies but only moderately (selecting the top 20,000 out of 40+ million k-mers). The area under the receiver operating characteristic (AUROC) of the RF (0.95) is comparable to that of BLR (0.94) on both out-of-bag samples from bootstrap and the external test (n = 1085), where AUROCs do not decrease. We observe a 1 %–5 % gain in AUROC with bias-handling compared to the sole use of genetic signatures. In conclusion, we recommend using causally-informed prediction methods for modeling real-world AMR data; however, traditional adjustment or propensity-based methods may not provide advantage in all use cases and further methodological development should be sought.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Assessing putative bias in prediction of anti-microbial resistance from real-world genotyping data under explicit causal assumptions

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence in Medicine

Lead the way for us

Similar Papers

527. Antimicrobial stewardship for empirical treatment of bloodstream infection using machine learning clinical decision support
Sanjat Kanjilal ... Luke Sagers
Open Forum Infectious Diseases | VOL. 9
Sanjat Kanjilal, et. al.Sanjat Kanjilal ... Luke Sagers
15 Dec 2022
Open Forum Infectious Diseases | VOL. 9

Challenges with antimicrobial susceptibility testing for Neisseria gonorrhoeae in the era of extensively drug-resistant gonorrhoea — molecular antimicrobial resistance testing crucial
Magnus Unemo
Pathogens and Global Health | VOL. 108
Magnus UnemoMagnus Unemo
01 Jul 2014
Pathogens and Global Health | VOL. 108

Making sense of antimicrobial use and resistance surveillance data: application of ARIMA and transfer function models
D.L Monnet ... N Gonzalo
Clinical Microbiology and Infection | VOL. 7
D.L Monnet, et. al.D.L Monnet ... N Gonzalo
01 Jan 2001
Clinical Microbiology and Infection | VOL. 7

PATRIC as a unique resource for studying antimicrobial resistance.
...
Briefings in Bioinformatics | VOL. 20
, et. al. ...
31 Jul 2017
Briefings in Bioinformatics | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Assessing putative bias in prediction of anti-microbial resistance from real-world genotyping data under explicit causal assumptions

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence in Medicine