Abstract

Core genome multilocus sequence typing (cgMLST) has gained popularity in recent years in epidemiological research and subspecies-level classification. cgMLST retains the intuitive nature of traditional MLST but offers much greater resolution by utilizing significantly larger portions of the genome. Here, we introduce a cgMLST scheme for Vibrio cholerae, a bacterium abundant in marine and freshwater environments and the etiologic agent of cholera. A set of 2,443 core genes ubiquitous in V. cholerae were used to analyze a comprehensive data set of 1,262 clinical and environmental strains collected from 52 countries, including 65 newly sequenced genomes in this study. We established a sublineage threshold based on 133 allelic differences that creates clusters nearly identical to traditional MLST types, providing backwards compatibility to new cgMLST classifications. We also defined an outbreak threshold based on seven allelic differences that is capable of identifying strains from the same outbreak and closely related isolates that could give clues on outbreak origin. Using cgMLST, we confirmed the South Asian origin of modern epidemics and identified clustering affinity among sublineages of environmental isolates from the same geographic origin. Advantages of this method are highlighted by direct comparison with existing classification methods, such as MLST and single-nucleotide polymorphism-based methods. cgMLST outperforms all existing methods in terms of resolution, standardization, and ease of use. We anticipate this scheme will serve as a basis for a universally applicable and standardized classification system for V. cholerae research and epidemiological surveillance in the future. This cgMLST scheme is publicly available on PubMLST (https://pubmlst.org/vcholerae/).IMPORTANCE Toxigenic Vibrio cholerae isolates of the O1 and O139 serogroups are the causative agents of cholera, an acute diarrheal disease that plagued the world for centuries, if not millennia. Here, we introduce a core genome multilocus sequence typing scheme for V. cholerae Using this scheme, we have standardized the definition for subspecies-level classification, facilitating global collaboration in the surveillance of V. cholerae In addition, this typing scheme allows for quick identification of outbreak-related isolates that can guide subsequent analyses, serving as an important first step in epidemiological research. This scheme is also easily scalable to analyze thousands of isolates at various levels of resolution, making it an invaluable tool for large-scale ecological and evolutionary analyses.

Highlights

  • Cholera is transmitted in a fecal-oral route mostly by contaminated food or water [1, 2]

  • We identified a total of 1,026 core genome sequence types (cgSTs) from 1,262 genomes collected from 52 180 countries

  • 12 sequence type (ST) are exclusively present in the 7th pandemic El 186 Tor lineage identified using traditional multilocus sequence typing (MLST) [20, 21], whereas 560 cgSTs are uniquely present 187 in this group based on Core genome multilocus sequence typing (cgMLST) (Table S1)

Read more

Summary

INTRODUCTION

Cholera is transmitted in a fecal-oral route mostly by contaminated food or water [1, 2]. This increase in efficiency, standardizability, and resolution compared to current methods make cgMLST the most suitable classification scheme for large scale V. cholerae surveillance By applying this scheme to our collection of over 1,200 isolates collected around the world, it was possible to establish outbreak and sublineage thresholds which allowed us to validate the South Asian origin of many modern epidemics as proposed in previous studies [5, 57, 58] and identify clustering affinity among environmental strains, where isolates from the same sublineage are likely from the same geographic region. This pattern is not seen in clinical isolates, as human hosts readily carry them over large geographical distances

173 RESULTS AND DISCUSSION
458 Conclusion
Introduction
FIGURES AND FIGURE LEGENDS
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call