MentaLiST - A fast MLST caller for large MLST schemes.

Pedro Feijao,Hua-Ting Yao,Cedric Chauve,Jennifer Gardy,William Hsiao,Dan Fornika,Leonid Chindelevitch

doi:10.1099/mgen.0.000146

Abstract

MLST (multi-locus sequence typing) is a classic technique for genotyping bacteria, widely applied for pathogen outbreak surveillance. Traditionally, MLST is based on identifying sequence types from a small number of housekeeping genes. With the increasing availability of whole-genome sequencing data, MLST methods have evolved towards larger typing schemes, based on a few hundred genes [core genome MLST (cgMLST)] to a few thousand genes [whole genome MLST (wgMLST)]. Such large-scale MLST schemes have been shown to provide a finer resolution and are increasingly used in various contexts such as hospital outbreaks or foodborne pathogen outbreaks. This methodological shift raises new computational challenges, especially given the large size of the schemes involved. Very few available MLST callers are currently capable of dealing with large MLST schemes. We introduce MentaLiST, a new MLST caller, based on a k-mer voting algorithm and written in the Julia language, specifically designed and implemented to handle large typing schemes. We test it on real and simulated data to show that MentaLiST is faster than any other available MLST caller while providing the same or better accuracy, and is capable of dealing with MLST schemes with up to thousands of genes while requiring limited computational resources. MentaLiST source code and easy installation instructions using a Conda package are available at https://github.com/WGS-TB/MentaLiST.

Highlights

Since it was introduced by Maiden et al in 1998 [1], multilocus sequence typing (MLST) has become a fundamental technique for classifying bacterial isolates into strains
In the specific case of MLST, this has led to the emergence of MLST schemes based on a larger set of genes, such as core genome MLST, that consider the set of core genes shared by a group of related strains, and even whole genome MLST
As expected for a traditional MLST scheme, all tested methods made identical calls on all 41 samples, except for SRST2, where on two samples the call for gene ddl was different from the other callers, 11 versus 5 on both cases, and had the flags ‘*?’ indicating mismatches and uncertainty due to a low depth of coverage in certain parts of the gene, according to SRST2 documentation

Summary

Introduction

Since it was introduced by Maiden et al in 1998 [1], multilocus sequence typing (MLST) has become a fundamental technique for classifying bacterial isolates into strains It has been applied in a large number of contexts, especially related to pathogen outbreak surveillance [2]. Jolley et al showed that traditional MLST schemes were not able to discriminate separate sublineages within a clonal complex of Neisseria meningitidis [4] This observation has come at a time when advances in sequencing technologies and protocols have had a major impact on public health, as it is common to rapidly obtain WGS data from a pathogen outbreak, allowing for monitoring at an unprecedented level of resolution [5,6,7,8,9,10,11,12,13]. In the specific case of MLST, this has led to the emergence of MLST schemes based on a larger set of genes, such as core genome MLST (cgMLST), that consider the set of core genes shared by a group of related strains (generally a few hundred genes), and even whole genome MLST (wgMLST)

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Microbial Genomics	Publication Date: Jan 10, 2018
Citations: 45	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

MentaLiST - A fast MLST caller for large MLST schemes.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Microbial Genomics

Lead the way for us

Similar Papers

Core Genome Multilocus Sequence Typing for Identification of Globally Distributed Clonal Groups and Differentiation of Outbreak Strains of Listeria monocytogenes.
Yi Chen ... Errol A Strain
Applied and Environmental Microbiology | VOL. 82
Yi Chen, et. al.Yi Chen ... Errol A Strain
12 Aug 2016
Applied and Environmental Microbiology | VOL. 82

1932. Molecular Epidemiology of NDM-producing Acinetobacter baumannii in the US—October 2013—March 2022
...
Open Forum Infectious Diseases | VOL. 10
, et. al. ...
27 Nov 2023
1932. Molecular Epidemiology of NDM-producing Acinetobacter baumannii in the US—October 2013—March 2022
...

Multilocus sequence typing schemes for the emerging swine pathogen Mycoplasma hyosynoviae
Moritz Bünger ... Joachim Spergser
Veterinary Microbiology | VOL. 290
Moritz Bünger, et. al.Moritz Bünger ... Joachim Spergser
15 Jan 2024
Veterinary Microbiology | VOL. 290

Development of a multilocus sequence typing scheme for Streptococcus gallolyticus
Yusuke Shibata ... Ryohei Nomoto
Microbiology | VOL. 160
Yusuke Shibata, et. al.Yusuke Shibata ... Ryohei Nomoto
16 Oct 2013
Microbiology | VOL. 160

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MentaLiST - A fast MLST caller for large MLST schemes.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Microbial Genomics