Abstract
BackgroundStreptococcus suis is divided into 29 serotypes based on a serological reaction against the capsular polysaccharide (CPS). Multiplex PCR tests targeting the cps locus are also used to determine S. suis serotypes, but they cannot differentiate between serotypes 1 and 14, and between serotypes 2 and 1/2. Here, we developed a pipeline permitting in silico serotype determination from whole-genome sequencing (WGS) short-read data that can readily identify all 29 S. suis serotypes.ResultsWe sequenced the genomes of 121 strains representing all 29 known S. suis serotypes. We next combined available software into an automated pipeline permitting in silico serotyping of strains by differential alignment of short-read sequencing data to a custom S. suis cps loci database. Strains of serotype pairs 1 and 14, and 2 and 1/2 could be differentiated by a missense mutation in the cpsK gene. We report a 99 % match between coagglutination- and pipeline-determined serotypes for strains in our collection. We used 375 additional S. suis genomes downloaded from the NCBI’s Sequence Read Archive (SRA) to validate the pipeline. Validation with SRA WGS data resulted in a 92 % match. Included pipeline subroutines permitted us to assess strain virulence marker content and obtain multilocus sequence typing directly from WGS data.ConclusionsOur pipeline permits rapid and accurate determination of S. suis serotype, and other lineage information, directly from WGS data. By discriminating between serotypes 1 and 14, and between serotypes 2 and 1/2, our approach solves a three-decade longstanding S. suis typing issue.Electronic supplementary materialThe online version of this article (doi:10.1186/s12866-016-0782-8) contains supplementary material, which is available to authorized users.
Highlights
Streptococcus suis is divided into 29 serotypes based on a serological reaction against the capsular polysaccharide (CPS)
S. suis strains are typed based on a serological reaction against the capsular polysaccharide (CPS), whose biosynthesis machinery is encoded by genes located in the cps locus [3, 4]
We show that whole-genome sequencing (WGS) short-read data, which can be generated rapidly and relatively inexpensively for large numbers of strains, can be used to precisely identify the serotype of S. suis strains, as well as to obtain other relevant strain information in an automated fashion
Summary
Streptococcus suis is divided into 29 serotypes based on a serological reaction against the capsular polysaccharide (CPS). We developed a pipeline permitting in silico serotype determination from whole-genome sequencing (WGS) short-read data that can readily identify all 29 S. suis serotypes. S. suis strains are typed based on a serological reaction against the capsular polysaccharide (CPS), whose biosynthesis machinery is encoded by genes located in the cps locus [3, 4]. We report the development of a user-friendly bioinformatics pipeline running on the Linux operating system that combines available free software, and which can be used for rapid determination of S. suis serotype directly from the short-read WGS data. The pipeline uses the short-read WGS data to determine strain multilocus sequence typing (MLST) sequence type (ST) and virulence factor content
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have