Permanent Draft Research Articles

BackgroundOur understanding of the composition, function, and health implications of human microbiota has been advanced by high-throughput sequencing and the development of new genomic analyses. However, trade-offs among alternative strategies for the acquisition and analysis of sequence data remain understudied.MethodsWe assessed eight popular taxonomic profiling pipelines; MetaPhlAn2, metaMix, PathoScope 2.0, Sigma, Kraken, ConStrains, Centrifuge and Taxator-tk, against a battery of metagenomic datasets simulated from real data. The metagenomic datasets were modeled on 426 complete or permanent draft genomes stored in the Human Oral Microbiome Database and were designed to simulate various experimental conditions, both in the design of a putative experiment; read length (75–1,000 bp reads), sequence depth (100K–10M), and in metagenomic composition; number of species present (10, 100, 426), species distribution. The sensitivity and specificity of each of the pipelines under various scenarios were measured. We also estimated the relative root mean square error and average relative error to assess the abundance estimates produced by different methods. Additional datasets were generated for five of the pipelines to simulate the presence within a metagenome of an unreferenced species, closely related to other referenced species. Additional datasets were also generated in order to measure computational time on datasets of ever-increasing sequencing depth (up to 6 × 107).ResultsTesting of eight pipelines against 144 simulated metagenomic datasets initially produced 1,104 discrete results. Pipelines using a marker gene strategy; MetaPhlAn2 and ConStrains, were overall less sensitive, than other pipelines; with the notable exception of Taxator-tk. This difference in sensitivity was largely made up in terms of runtime, significantly lower than more sensitive pipelines that rely on whole-genome alignments such as PathoScope2.0. However, pipelines that used strategies to speed-up alignment between genomic references and metagenomic reads, such as kmerization, were able to combine both high sensitivity and low run time, as is the case with Kraken and Centrifuge. Absent species genomes in the database mostly led to assignment of reads to the most closely related species available in all pipelines. Our results therefore suggest that taxonomic profilers that use kmerization have largely superseded those that use gene markers, coupling low run times with high sensitivity and specificity. Taxonomic profilers using more time-consuming read reassignment, such as PathoScope 2.0, provided the most sensitive profiles under common metagenomic sequencing scenarios. All the results described and discussed in this paper can be visualized using the dedicated R Shiny application (https://github.com/microgenomics/HumanMicrobiomeAnalysis). All of our datasets, pipelines and results are made available through the GitHub repository for future benchmarking.

Read full abstract

The permanent draft genome sequence of Actinotignum schaalii DSM 15541T is presented. The annotated genome includes 2,130,987 bp, with 1777 protein-coding and 58 rRNA-coding genes. Genome sequence analysis revealed absence of genes encoding for: components of the PTS systems, enzymes of the TCA cycle, glyoxylate shunt and gluconeogensis. Genomic data revealed that A. schaalii is able to oxidize carbohydrates via glycolysis, the nonoxidative pentose phosphate and the Entner-Doudoroff pathways. Besides, the genome harbors genes encoding for enzymes involved in the conversion of pyruvate to lactate, acetate and ethanol, which are found to be the end products of carbohydrate fermentation. The genome contained the gene encoding Type I fatty acid synthase required for de novo FAS biosynthesis. The plsY and plsX genes encoding the acyltransferases necessary for phosphatidic acid biosynthesis were absent from the genome. The genome harbors genes encoding enzymes responsible for isoprene biosynthesis via the mevalonate (MVA) pathway. Genes encoding enzymes that confer resistance to reactive oxygen species (ROS) were identified. In addition, A. schaalii harbors genes that protect the genome against viral infections. These include restriction-modification (RM) systems, type II toxin-antitoxin (TA), CRISPR-Cas and abortive infection system. A. schaalii genome also encodes several virulence factors that contribute to adhesion and internalization of this pathogen such as the tad genes encoding proteins required for pili assembly, the nanI gene encoding exo-alpha-sialidase, genes encoding heat shock proteins and genes encoding type VII secretion system. These features are consistent with anaerobic and pathogenic lifestyles. Finally, resistance to ciprofloxacin occurs by mutation in chromosomal genes that encode the subunits of DNA-gyrase (GyrA) and topisomerase IV (ParC) enzymes, while resistant to metronidazole was due to the frxA gene, which encodes NADPH-flavin oxidoreductase.

Read full abstract

Permanent Draft Research Articles

Related Topics

Articles published on Permanent Draft

Permanent draft genome sequences of cadmium-resistant isolates of Cupriavidus from soils within the Tar Creek Superfund site.

Permanent draft genome sequence of Bradyrhizobium vignae, strain ISRA 400, an elite nitrogen-fixing bacterium, isolated from the groundnut growing area in Senegal.

Evaluation of computational methods for human microbiome analysis using simulated data.

Genome sequence of Epibacterium ulvae strain DSM 24752T, an indigoidine-producing, macroalga-associated member of the marine Roseobacter group

Draft Genome of Burkholderia cenocepacia TAtl-371, a Strain from the Burkholderia cepacia Complex Retains Antagonism in Different Carbon and Nitrogen Sources.

Draft genome sequence of Actinotignum schaalii DSM 15541T: Genetic insights into the lifestyle, cell fitness and virulence.

Permanent Draft Genome Sequence of Rhizobium sp. Strain LCM 4573, a Salt-Tolerant, Nitrogen-Fixing Bacterium Isolated from Senegalese Soils.

Permanent Draft Genome Sequence of Desulfurococcus amylolyticus Strain Z-533T, a Peptide and Starch Degrader Isolated from Thermal Springs in the Kamchatka Peninsula and Kunashir Island, Russia.

Permanent Draft Genome Sequence of the French Bean Symbiont Rhizobium sp. Strain RSm-3 Isolated from the Eastern Himalayan Region of India.

Permanent Draft Genome Sequences of Three Frankia sp. Strains That Are Atypical, Noninfective, Ineffective Isolates.

High-quality permanent draft genome sequence of Bradyrhizobium sp. strain WSM1743 - an effective microsymbiont of an Indigofera sp. growing in Australia.

High-quality permanent draft genome sequence of the Mimosa asperata - nodulating Cupriavidus sp. strain AMP6.

High-quality permanent draft genome sequence of the Lebeckia ambigua-nodulating Burkholderia sp. strain WSM4176.

High-quality permanent draft genome sequence of the Lebeckia - nodulating Burkholderia dilworthii strain WSM3556(T).

Permanent draft genome of ‘Rhodopirellula islandica’ strain K833

High-quality permanent draft genome sequence of Rhizobium sullae strain WSM1592; a Hedysarum coronarium microsymbiont from Sassari, Italy.

High-quality permanent draft genome sequence of Ensifer meliloti strain 4H41, an effective salt- and drought-tolerant microsymbiont of Phaseolus vulgaris.

High-quality permanent draft genome sequence of Bradyrhizobium sp. Ai1a-2; a microsymbiont of Andira inermis discovered in Costa Rica

High-quality permanent draft genome sequence of the Parapiptadenia rigida-nodulating Burkholderia sp. strain UYPR1.413.

High-quality permanent draft genome sequence of Bradyrhizobium sp. Tv2a.2, a microsymbiont of Tachigali versicolor discovered in Barro Colorado Island of Panama.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Permanent Draft Research Articles

Related Topics

Articles published on Permanent Draft

Permanent draft genome sequences of cadmium-resistant isolates of Cupriavidus from soils within the Tar Creek Superfund site.

Permanent draft genome sequence of Bradyrhizobium vignae, strain ISRA 400, an elite nitrogen-fixing bacterium, isolated from the groundnut growing area in Senegal.

Evaluation of computational methods for human microbiome analysis using simulated data.

Genome sequence of Epibacterium ulvae strain DSM 24752T, an indigoidine-producing, macroalga-associated member of the marine Roseobacter group

Draft Genome of Burkholderia cenocepacia TAtl-371, a Strain from the Burkholderia cepacia Complex Retains Antagonism in Different Carbon and Nitrogen Sources.

Draft genome sequence of Actinotignum schaalii DSM 15541T: Genetic insights into the lifestyle, cell fitness and virulence.

Permanent Draft Genome Sequence of Rhizobium sp. Strain LCM 4573, a Salt-Tolerant, Nitrogen-Fixing Bacterium Isolated from Senegalese Soils.

Permanent Draft Genome Sequence of Desulfurococcus amylolyticus Strain Z-533T, a Peptide and Starch Degrader Isolated from Thermal Springs in the Kamchatka Peninsula and Kunashir Island, Russia.

Permanent Draft Genome Sequence of the French Bean Symbiont Rhizobium sp. Strain RSm-3 Isolated from the Eastern Himalayan Region of India.

Permanent Draft Genome Sequences of Three Frankia sp. Strains That Are Atypical, Noninfective, Ineffective Isolates.

High-quality permanent draft genome sequence of Bradyrhizobium sp. strain WSM1743 - an effective microsymbiont of an Indigofera sp. growing in Australia.

High-quality permanent draft genome sequence of the Mimosa asperata - nodulating Cupriavidus sp. strain AMP6.

High-quality permanent draft genome sequence of the Lebeckia ambigua-nodulating Burkholderia sp. strain WSM4176.

High-quality permanent draft genome sequence of the Lebeckia - nodulating Burkholderia dilworthii strain WSM3556(T).

Permanent draft genome of ‘Rhodopirellula islandica’ strain K833

High-quality permanent draft genome sequence of Rhizobium sullae strain WSM1592; a Hedysarum coronarium microsymbiont from Sassari, Italy.

High-quality permanent draft genome sequence of Ensifer meliloti strain 4H41, an effective salt- and drought-tolerant microsymbiont of Phaseolus vulgaris.

High-quality permanent draft genome sequence of Bradyrhizobium sp. Ai1a-2; a microsymbiont of Andira inermis discovered in Costa Rica

High-quality permanent draft genome sequence of the Parapiptadenia rigida-nodulating Burkholderia sp. strain UYPR1.413.

High-quality permanent draft genome sequence of Bradyrhizobium sp. Tv2a.2, a microsymbiont of Tachigali versicolor discovered in Barro Colorado Island of Panama.