Abstract

Shotgun metagenomics sequencing is a powerful tool for the characterization of complex biological matrices, enabling analysis of prokaryotic and eukaryotic organisms and viruses in a single experiment, with the possibility of reconstructing de novo the whole metagenome or a set of genes of interest. One of the main factors limiting the use of shotgun metagenomics on wide scale projects is the high cost associated with the approach. We set out to determineif it is possible to use shallow shotgun metagenomics to characterize complex biological matrices while reducing costs. We used a staggered mock community to estimate the optimal threshold for species detection. We measured the variation of several summary statistics simulating a decrease in sequencing depth by randomly subsampling a number of reads. The main statistics that were compared are diversity estimates, species abundance, and ability of reconstructing de novo the metagenome in terms of length and completeness. Our results show that diversity indices of complex prokaryotic, eukaryotic and viral communities can be accurately estimated with 500,000 reads or less, although particularly complex samples may require 1,000,000 reads. On the contrary, any task involving the reconstruction of the metagenome performed poorly, even with the largest simulated subsample (1,000,000 reads). The length of the reconstructed assembly was smaller than the length obtained with the full dataset, and the proportion of conserved genes that were identified in the meta-genome was drastically reduced compared to the full sample. Shallow shotgun metagenomics can be a useful tool to describe the structure of complex matrices, but it is not adequate to reconstruct-even partially-the metagenome.

Highlights

  • Reviewers Francesco Dal Grande We shared the issue of the reviewer regarding the detection of a false positive in the mock community and we further investigated the issue

  • We set out to test the effect of the reduction of sequencing depth in metagenome shotgun sequencing experiments on 1) estimates of diversity and species richness; 2) estimates of species abundance, and 3) completeness of de novo reconstruction of the genome of the species present in complex matrices

  • A possible explanation is that in the used database, genomic sequence of Escherichia coli is classified as Shigella flexneri

Read more

Summary

METHOD ARTICLE

Effect of coverage reduction on metagenome shotgun sequencing studies [version 4; peer review: 2 approved, 2 not approved]. Effect of coverage reduction on species identification and genome reconstruction in complex biological matrices metagenome shotgun high throughput sequencing. Federica Cattonaro 1, Alessandro Spadotto[1], Slobodanka Radovic[1], Fabio Marroni 1,2.

22 Mar 2019 report report report report version 1
Methods
Results
Discussion
Human Microbiome Project Consortium
27. Martin M
36. R Core Team
42. Marroni F
All fonts in Figure 5 are too small and unreadable

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.