Validation strategy of a bioinformatics whole genome sequencing workflow for Shiga toxin-producing Escherichia coli using a reference collection extensively characterized with conventional methods.

Bert Bogaerts,Stéphanie Nouws,Kathleen Marchal,Sarah Denayer,Raf Winand,Bavo Verhaegen,Florence Crombé,Denis Piérard,Qiang Fu,Nancy H C Roosens,Kevin Vanneste,Sigrid C J De Keersmaecker,Julien Van Braekel

doi:10.1099/mgen.0.000531

Abstract

Whole genome sequencing (WGS) enables complete characterization of bacterial pathogenic isolates at single nucleotide resolution, making it the ultimate tool for routine surveillance and outbreak investigation. The lack of standardization, and the variation regarding bioinformatics workflows and parameters, however, complicates interoperability among (inter)national laboratories. We present a validation strategy applied to a bioinformatics workflow for Illumina data that performs complete characterization of Shiga toxin-producing Escherichia coli (STEC) isolates including antimicrobial resistance prediction, virulence gene detection, serotype prediction, plasmid replicon detection and sequence typing. The workflow supports three commonly used bioinformatics approaches for the detection of genes and alleles: alignment with blast+, kmer-based read mapping with KMA, and direct read mapping with SRST2. A collection of 131 STEC isolates collected from food and human sources, extensively characterized with conventional molecular methods, was used as a validation dataset. Using a validation strategy specifically adopted to WGS, we demonstrated high performance with repeatability, reproducibility, accuracy, precision, sensitivity and specificity above 95 % for the majority of all assays. The WGS workflow is publicly available as a ‘push-button’ pipeline at https://galaxy.sciensano.be. Our validation strategy and accompanying reference dataset consisting of both conventional and WGS data can be used for characterizing the performance of various bioinformatics workflows and assays, facilitating interoperability between laboratories with different WGS and bioinformatics set-ups.

Highlights

Whole genome sequencing (WGS) has revolutionized foodborne outbreak investigation and surveillance of a wide variety of microbial pathogens [1]
All other false negatives (FNs) were caused by alleles present in the EnteroBase scheme missing from the PubMLST scheme, even though both databases were assessed at the same time and alleles that were added to EnteroBase more recently than the missing ones from PubMLST were available in both
We present an updated validation framework to extensively validate a bioinformatics workflow (Fig. 1) for the characterization of Shiga toxin-p roducing Escherichia coli (STEC) isolates using WGS data

Summary

Introduction

Whole genome sequencing (WGS) has revolutionized foodborne outbreak investigation and surveillance of a wide variety of microbial pathogens [1]. WGS-b ased methods for relatedness investigation can be scaled up from case-b y-c ase applications to routine surveillance, as illustrated by EnteroBase for cgMLST [6], and SnapperDB for wgSNP [7] analysis Because of these advantages, the use of WGS for pathogen typing in both outbreak situations and routine surveillance is becoming more widespread, with many national reference centres (NRCs, human) and laboratories (NRLs, food and feed) integrating it into their routine activities [1, 8, 9]. The second hurdle, i.e. the need for validation of bioinformatics assays to demonstrate that they are ‘fit-for-purpose’ and adhere to certain predefined quality characteristics, as

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Microbial Genomics	Publication Date: Mar 1, 2021
Citations: 23	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Validation strategy of a bioinformatics whole genome sequencing workflow for Shiga toxin-producing Escherichia coli using a reference collection extensively characterized with conventional methods.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Microbial Genomics

Lead the way for us

Similar Papers

Molecular characterization and phylogeny of Shiga toxin-producing Escherichia coli derived from cattle farm.
Shiqin Zhang ... Qingli Dong
Frontiers in Microbiology | VOL. 13
Shiqin Zhang, et. al.Shiqin Zhang ... Qingli Dong
04 Aug 2022
Frontiers in Microbiology | VOL. 13

Can Whole Genome and Whole Transcriptome Sequencing Replace Standard Procedures in CLL Diagnostics?
Heiko Mueller ... Claudia Haferlach
Blood | VOL. 142
Heiko Mueller, et. al.Heiko Mueller ... Claudia Haferlach
02 Nov 2023
Blood | VOL. 142

Implementation of Whole Genome Sequencing (WGS) for Identification and Characterization of Shiga Toxin-Producing Escherichia coli (STEC) in the United States.
Rebecca L Lindsey ... Nancy A Strockbine
Frontiers in Microbiology | VOL. 7
Rebecca L Lindsey, et. al.Rebecca L Lindsey ... Nancy A Strockbine
23 May 2016
Frontiers in Microbiology | VOL. 7

Whole Genome Sequencing Characterization of Shiga Toxin–Producing Escherichia coli Isolated from Flour from Swiss Retail Markets
Renate Boss ... Joerg Hummerjohann
Journal of Food Protection | VOL. 82
Renate Boss, et. al.Renate Boss ... Joerg Hummerjohann
01 Aug 2019
Journal of Food Protection | VOL. 82

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Validation strategy of a bioinformatics whole genome sequencing workflow for Shiga toxin-producing Escherichia coli using a reference collection extensively characterized with conventional methods.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Microbial Genomics