Abstract
As WGS is increasingly used by food industry to characterize pathogen isolates, users are challenged by the variety of analysis approaches available, ranging from methods that require extensive bioinformatics expertise to commercial software packages. This study aimed to assess the impact of analysis pipelines (i.e., different hqSNP pipelines, a cg/wgMLST pipeline) and the reference genome selection on analysis results (i.e., hqSNP and allelic differences as well as tree topologies) and conclusion drawn. For these comparisons, whole genome sequences were obtained for 40 Listeria monocytogenes isolates collected over 18 years from a cold-smoked salmon facility and 2 other isolates obtained from different facilities as part of academic research activities; WGS data were analyzed with three hqSNP pipelines and two MLST pipelines. After initial clustering using a k-mer based approach, hqSNP pipelines were run using two types of reference genomes: (i) closely related closed genomes (“closed references”) and (ii) high-quality de novo assemblies of the dataset isolates (“draft references”). All hqSNP pipelines identified similar hqSNP difference ranges among isolates in a given cluster; use of different reference genomes showed minimal impacts on hqSNP differences identified between isolate pairs. Allelic differences obtained by wgMLST showed similar ranges as hqSNP differences among isolates in a given cluster; cgMLST consistently showed fewer differences than wgMLST. However, phylogenetic trees and dendrograms, obtained based on hqSNP and cg/wgMLST data, did show some incongruences, typically linked to clades supported by low bootstrap values in the trees. When a hqSNP cutoff was used to classify isolates as “related” or “unrelated,” use of different pipelines yielded a considerable number of discordances; this finding supports that cut-off values are valuable to provide a starting point for an investigation, but supporting and epidemiological evidence should be used to interpret WGS data. Overall, our data suggest that cgMLST-based data analyses provide for appropriate subtype differentiation and can be used without the need for preliminary data analyses (e.g., k-mer based clustering) or external closed reference genomes, simplifying data analyses needs. hqSNP or wgMLST analyses can be performed on the isolate clusters identified by cgMLST to increase the precision on determining the genomic similarity between isolates.
Highlights
Listeria monocytogenes is a major problem for the food industry and food business operators, including small manufacturers and retailers, as this organism is ubiquitous in nature and is highly adapted to conditions usually used for food preservation and safety (Buchanan et al, 2017)
Differences not present in the core genome are not assessed. Another genome-wide multi locus sequence typing (MLST) scheme being used by the US Centers for Disease Control and Prevention (CDC) and PulseNet International is known as the whole genome MLST pipeline provided through BioNumerics (Jackson et al, 2016; Nadon et al, 2017)
Phylogenetic trees were built from high quality single nucleotide polymorphisms (hqSNPs) matrices and dendrograms from cg/whole genome MLST (wgMLST) allelic differences
Summary
Listeria monocytogenes is a major problem for the food industry and food business operators, including small manufacturers and retailers, as this organism is ubiquitous in nature and is highly adapted to conditions usually used for food preservation and safety (Buchanan et al, 2017). Persistence of a L. monocytogenes strain in a food facility can often be traced back to unhygienic design of equipment, infra-structure problems (e.g., hollow framework, cracks on floor) or inefficient cleaning and sanitation procedures. In addition to identifying persistent strains, the food industry has a need to investigate the relatedness of organisms involved in a single contamination event to aid accurate root cause analysis. Findings of such an analysis can optimize the implementation of appropriate corrective actions to prevent recurrence of contamination issues
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.