Abstract

BackgroundRecent advances in sequencing technologies have driven studies identifying the microbiome as a key regulator of overall health and disease in the host. Both 16S amplicon and whole genome shotgun sequencing technologies are currently being used to investigate this relationship, however, the choice of sequencing technology often depends on the nature and experimental design of the study. In principle, the outputs rendered by analysis pipelines are heavily influenced by the data used as input; it is then important to consider that the genomic features produced by different sequencing technologies may emphasize different results.ResultsIn this work, we use public 16S amplicon and whole genome shotgun sequencing (WGS) data from the same dogs to investigate the relationship between sequencing technology and the captured gut metagenomic landscape in dogs. In our analyses, we compare the taxonomic resolution at the species and phyla levels and benchmark 12 classification algorithms in their ability to accurately identify host phenotype using only taxonomic relative abundance information from 16S and WGS datasets with identical study designs. Our best performing model, a random forest trained by the WGS dataset, identified a species (Bacteroides coprocola) that predominantly contributes to the abundance of leuB, a gene involved in branched chain amino acid biosynthesis; a risk factor for glucose intolerance, insulin resistance, and type 2 diabetes. This trend was not conserved when we trained the model using 16S sequencing profiles from the same dogs.ConclusionsOur results indicate that WGS sequencing of dog microbiomes detects a greater taxonomic diversity than 16S sequencing of the same dogs at the species level and with respect to four gut-enriched phyla levels. This difference in detection does not significantly impact the performance metrics of machine learning algorithms after down-sampling. Although the important features extracted from our best performing model are not conserved between the two technologies, the important features extracted from either instance indicate the utility of machine learning algorithms in identifying biologically meaningful relationships between the host and microbiome community members. In conclusion, this work provides the first systematic machine learning comparison of dog 16S and WGS microbiomes derived from identical study designs.

Highlights

  • Recent advances in sequencing technologies have driven studies identifying the microbiome as a key regulator of overall health and disease in the host

  • We further investigated the shared taxonomies identified between the two technologies with respect to four predominant phyla that are commonplace in the gut microbiome

  • The algorithms we evaluated in this study achieved highly variable accuracy in all metrics while the random forest, our best performing model, identified key functional taxonomies that support biologically meaningful insights into regulatory mechanisms driven by microbiome compositions specific to the diet and weight phenotypes of host samples

Read more

Summary

Introduction

Recent advances in sequencing technologies have driven studies identifying the microbiome as a key regulator of overall health and disease in the host. Both 16S amplicon and whole genome shotgun sequencing technologies are currently being used to investigate this relationship, the choice of sequencing technology often depends on the nature and experimental design of the study. Dysfunction in the gut microbiome has been associated with host diseases including inflammatory bowel disease, obesity, and type 2 diabetes in humans [2,3,4,5]. Metagenomic studies have shown that dogs’ intestinal microbiota and its modification by prebiotics, probiotics, and synbiotics reveal that the dysbiosis network underlying inflammatory bowel disease in dogs differs from that in humans [11, 12], establishing a need for further research to understand the microbial communities present in dog microbiomes

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call