PathoScope 2.0: a complete computational framework for strain identification in environmental or clinical sequencing samples.

Changjin Hong,Eduardo Castro-Nallar,Allyson L Byrd,Ying Shen,William Evan Johnson,Solaiappan Manimaran,Joseph F Perez-Rogers,Keith A Crandall

doi:10.1186/2049-2618-2-33

Changjin Hong, Eduardo Castro-Nallar + Show 6 more

Open Access

https://doi.org/10.1186/2049-2618-2-33

Copy DOI

Abstract

BackgroundRecent innovations in sequencing technologies have provided researchers with the ability to rapidly characterize the microbial content of an environmental or clinical sample with unprecedented resolution. These approaches are producing a wealth of information that is providing novel insights into the microbial ecology of the environment and human health. However, these sequencing-based approaches produce large and complex datasets that require efficient and sensitive computational analysis workflows. Many recent tools for analyzing metagenomic-sequencing data have emerged, however, these approaches often suffer from issues of specificity, efficiency, and typically do not include a complete metagenomic analysis framework.ResultsWe present PathoScope 2.0, a complete bioinformatics framework for rapidly and accurately quantifying the proportions of reads from individual microbial strains present in metagenomic sequencing data from environmental or clinical samples. The pipeline performs all necessary computational analysis steps; including reference genome library extraction and indexing, read quality control and alignment, strain identification, and summarization and annotation of results. We rigorously evaluated PathoScope 2.0 using simulated data and data from the 2011 outbreak of Shiga-toxigenic Escherichia coli O104:H4.ConclusionsThe results show that PathoScope 2.0 is a complete, highly sensitive, and efficient approach for metagenomic analysis that outperforms alternative approaches in scope, speed, and accuracy. The PathoScope 2.0 pipeline software is freely available for download at: http://sourceforge.net/projects/pathoscope/.

Highlights

Recent innovations in sequencing technologies have provided researchers with the ability to rapidly characterize the microbial content of an environmental or clinical sample with unprecedented resolution
The user supplies a set of National Center for Biotechnology Information (NCBI) taxonomy identification numbers for organisms to be included in the library (Figure 2)
As PathoLib extracts the reference library, the NCBI GeneInfo number is linked to the taxonomy identification (taxID), and the taxID and organism name are appended to the sequence headers to further link sequences in downstream analyses

Summary

Introduction

Recent innovations in sequencing technologies have provided researchers with the ability to rapidly characterize the microbial content of an environmental or clinical sample with unprecedented resolution. With the steadily increasing number of microbial genomes available in public data repositories, metagenomic characterization using high-throughput sequencing techniques can be used to catalogue microbes co-habituating in human systems [1] and to rapidly identify pathogens responsible for infectious disease outbreaks [2,3,4]. Assemblybased methods [12,13,14,15] have recently gained in popularity due to their increased sensitivity for strain identification These approaches can suffer from issues of specificity, efficiency, and typically do not include a complete metagenomic analysis framework with reference library generation, read quality control, and reporting

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Microbiome	Publication Date: Sep 5, 2014
Citations: 227	License type: cc-by

R Discovery Prime

R Discovery Prime

PathoScope 2.0: a complete computational framework for strain identification in environmental or clinical sequencing samples.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Microbiome

Lead the way for us

Similar Papers

Comparison of next-generation droplet digital PCR with quantitative PCR for enumeration of Naegleria fowleri in environmental water and clinical samples.
J Xue ... S.P Sherchan
Letters in Applied Microbiology | VOL. 67
J Xue, et. al.J Xue ... S.P Sherchan
01 Oct 2018
Letters in Applied Microbiology | VOL. 67

Design and implementation of a protocol for the detection of Legionella in clinical and environmental samples
Elizabeth J Nazarian ... Kimberlee A Musser
Diagnostic Microbiology and Infectious Disease | VOL. 62
Elizabeth J Nazarian, et. al.Elizabeth J Nazarian ... Kimberlee A Musser
14 Jul 2008
Diagnostic Microbiology and Infectious Disease | VOL. 62

Molecular Epidemiology of Aspergillus fumigatus: an In-Depth Genotypic Analysis of Isolates Involved in an Outbreak of Invasive Aspergillosis
Jesús Guinea ... Teresa Peláez
Journal of Clinical Microbiology | VOL. 49
Jesús Guinea, et. al.Jesús Guinea ... Teresa Peláez
10 Aug 2011
Journal of Clinical Microbiology | VOL. 49

Molecular characterization of hepatitis A virus isolates from environmental and clinical samples in Greece
Petros Kokkinos ... Panos Ziros
Virology Journal | VOL. 7
Petros Kokkinos, et. al.Petros Kokkinos ... Panos Ziros
16 Sep 2010
Virology Journal | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PathoScope 2.0: a complete computational framework for strain identification in environmental or clinical sequencing samples.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Microbiome