Abstract

Illumina sequencing platforms have been widely used for amplicon-based environmental microbiome research. Analyses of amplicon data of environmental samples, generated from Illumina MiSeq platform illustrate the reverse (R2) reads in the PE datasets to have low quality towards the 3' end of the reads which affect the sequencing depth of samples and ultimately impact the sample size which may possibly lead to an altered outcome. This study evaluates the usefulness of single-end (SE) sequencing data in microbiome research when the Illumina MiSeq PE dataset shows significantly high number of low-quality reverse reads. In this study, the amplicon data (V1V3, V3V4, V4V5 and V6V8) from 128 environmental (soil) samples, downloaded from SRA, demonstrate the efficiency of single-end (SE) sequencing data analyses in microbiome research. The SE datasets were found to infer the core microbiome structure as comparable to the PE dataset. Conspicuously, the forward (R1) datasets inferred a higher number of taxa as compared to PE datasets for most of the amplicon regions, except V3V4. Thus, analyses of SE sequencing data, especially R1 reads, in environmental microbiome studies could ameliorate the problems arising on sample size of the study due to low quality reverse reads in the dataset. However, care must be taken while interpreting the microbiome structure as few taxa observed in the PE datasets were absent in the SE datasets. In conclusion, this study demonstrates the availability of choices in analyzing the amplicon data without having the need to remove samples with low quality reverse reads.

Highlights

  • Amplicon-based microbiome approach has been widely employed to understand the community structure and role of microorganisms in environmental research

  • Analyses of SE sequencing data, especially R1 reads, in environmental microbiome studies could ameliorate the problems arising on sample size of the study due to low quality reverse reads in the dataset

  • The SE sequencing read analyses in microbiome research was evaluated using the data generated from the Illumina MiSeq platform which is widely used for amplicon microbiome research worldwide

Read more

Summary

Introduction

Amplicon-based microbiome approach has been widely employed to understand the community structure and role of microorganisms in environmental research. The Illumina MiSeq platform has advantages such as low cost, flexibility, fast run time and generation of 300 bp long paired-end (PE) sequencing reads (Wen et al 2017; Bharti and Grimm 2021), the platform has some disadvantages. One such disadvantage is having low phred quality towards the 3’ end of the reads (Liu et al 2020). The analyses revealed that the phred quality score of approximately 30–40% of R2 reads is reduced below 20 after ~ 225 bp

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call