A deeper understanding of the relationship between the antimicrobial resistance (AMR) gene carriage and phenotype is necessary to develop effective response strategies against this global burden. AMR phenotype is often a result of multi-gene interactions; therefore, we need approaches that go beyond current simple AMR gene identification tools. Machine-learning (ML) methods may meet this challenge and allow the development of rapid computational approaches for AMR phenotype classification. To examine this, we applied multiple ML techniques to 16,950 bacterial genomes across 28 genera, with corresponding MICs for 23 antibiotics with the aim of training models to accurately determine the AMR phenotype from sequenced genomes. This resulted in a >1.5-fold increase in AMR phenotype prediction accuracy over AMR gene identification alone. Furthermore, we revealed 528 unique (often species-specific) genomic routes to antibiotic resistance, including genes not previously linked to the AMR phenotype. Our study demonstrates the utility of ML in predicting AMR phenotypes across diverse clinically relevant organisms and antibiotics. This research proposes a rapid computational method to support laboratory-based identification of the AMR phenotype in pathogens.
Read full abstract