Long-read Metagenomics Research Articles

Microbial secondary metabolites play crucial roles in microbial competition, communication, resource acquisition, antibiotic production, and a variety of other biotechnological processes. The retrieval of full-length BGC (biosynthetic gene cluster) sequences from uncultivated bacteria is difficult due to the technical constraints of short-read sequencing, making it impossible to determine BGC diversity. Using long-read sequencing and genome mining, 339 mainly full-length BGCs were recovered in this study, illuminating the wide range of BGCs from uncultivated lineages discovered in seawater from Aoshan Bay, Yellow Sea, China. Many extremely diverse BGCs were discovered in bacterial phyla such as Proteobacteria, Bacteroidota, Acidobacteriota, and Verrucomicrobiota as well as the previously uncultured archaeal phylum "Candidatus Thermoplasmatota." The data from metatranscriptomics showed that 30.1% of secondary metabolic genes were being expressed, and they also revealed the expression pattern of BGC core biosynthetic genes and tailoring enzymes. Taken together, our results demonstrate that long-read metagenomic sequencing combined with metatranscriptomic analysis provides a direct view into the functional expression of BGCs in environmental processes. IMPORTANCE Genome mining of metagenomic data has become the preferred method for the bioprospecting of novel compounds by cataloguing secondary metabolite potential. However, the accurate detection of BGCs requires unfragmented genomic assemblies, which have been technically difficult to obtain from metagenomes until recently with new long-read technologies. We used high-quality metagenome-assembled genomes generated from long-read data to determine the biosynthetic potential of microbes found in the surface water of the Yellow Sea. We recovered 339 highly diverse and mostly full-length BGCs from largely uncultured and underexplored bacterial and archaeal phyla. Additionally, we present long-read metagenomic sequencing combined with metatranscriptomic analysis as a potential method for gaining access to the largely underutilized genetic reservoir of specialized metabolite gene clusters in the majority of microbes that are not cultured. The combination of long-read metagenomic and metatranscriptomic analyses is significant because it can more accurately assess the mechanisms of microbial adaptation to the environment through BGC expression based on metatranscriptomic data.

ABSTRACTThe recovery of DNA from viromes is a major obstacle in the use of long-read sequencing to study their genomes. For this reason, the use of cellular metagenomes (>0.2-μm size range) emerges as an interesting complementary tool, since they contain large amounts of naturally amplified viral genomes from prelytic replication. We have applied second-generation (Illumina NextSeq; short reads) and third-generation (PacBio Sequel II; long reads) sequencing to compare the diversity and features of the viral community in a marine sample obtained from offshore waters of the western Mediterranean. We found that a major wedge of the expected marine viral diversity was directly recovered by the raw PacBio circular consensus sequencing (CCS) reads. More than 30,000 sequences were detected only in this data set, with no homologues in the long- and short-read assembly, and ca. 26,000 had no homologues in the large data set of the Global Ocean Virome 2 (GOV2), highlighting the information gap created by the assembly bias. At the level of complete viral genomes, the performance was similar in both approaches. However, the hybrid long- and short-read assembly provided the longest average length of the sequences and improved the host assignment. Although no novel major clades of viruses were found, there was an increase in the intraclade genomic diversity recovered by long reads that produced an enriched assessment of the real diversity and allowed the discovery of novel genes with biotechnological potential (e.g., endolysin genes).IMPORTANCE We explored the vast genetic diversity of environmental viruses by using a combination of cellular metagenome (as opposed to virome) sequencing using high-fidelity long-read sequences (in this case, PacBio CCS). This approach resulted in the recovery of a representative sample of the viral population, and it performed better (more phage contigs, larger average contig size) than Illumina sequencing applied to the same sample. By this approach, the many biases of assembly are avoided, as the CCS reads recovers (typically around 5 kb) complete genes and even operons, resulting in a better discovery of the viral gene diversity based on viral marker proteins. Thus, biotechnologically promising genes, such as endolysin genes, can be very efficiently searched with this approach. In addition, hybrid assembly produces more complete and longer contigs, which is particularly important for studying little-known viral groups such as the nucleocytoplasmic large DNA viruses (NCLDV).

Long-read Metagenomics Research Articles

Related Topics

Articles published on Long-read Metagenomics

Taxometer: Improving taxonomic classification of metagenomics contigs

Decomposing a San Francisco estuary microbiome using long-read metagenomics reveals species- and strain-level dominance from picoeukaryotes to viruses.

Metagenomic evaluation of bacteria in drinking water using full-length 16S rRNA amplicons.

The drainome: longitudinal metagenomic characterization of wastewater from hospital ward sinks to characterize the microbiome and resistome and to assess the effects of decontamination interventions

A microbiome survey of contrasting potato terroirs using 16S rRNA long-read sequencing

Towards facilitated interpretation of shotgun metagenomics long-read sequencing data analyzed with KMA for the detection of bacterial pathogens and their antimicrobial resistance genes.

Sensitivity and consistency of long- and short-read metagenomics and epicPCR for the detection of antibiotic resistance genes and their bacterial hosts in wastewater

Insights into gut microbiomes in stem cell transplantation by comprehensive shotgun long-read sequencing

Spatiotemporal dynamics of giant viruses within a deep freshwater lake reveal a distinct dark-water community.

Long-Read Metagenomics of Marine Microbes Reveals Diversely Expressed Secondary Metabolites.

Exploring Long-Read Metagenomics for Full Characterization of Shiga Toxin-Producing Escherichia coli in Presence of Commensal E. coli.

A novel and diverse group of Candidatus Patescibacteria from bathypelagic Lake Baikal revealed through long-read metagenomics

Long-Read Metagenomics and CAZyme Discovery.

Long-read metagenomics paves the way toward a complete microbial tree of life.

A step forward for Shiga toxin-producing Escherichia coli identification and characterization in raw milk using long-read metagenomics.

Insights into ecological roles of uncultivated bacteria in Katase hot spring sediment from long-read metagenomics.

Nanopore-based long-read metagenomics uncover the resistome intrusion by antibiotic resistant bacteria from treated wastewater in receiving water body

Long-Read Metagenomics Improves the Recovery of Viral Diversity from Complex Natural Marine Samples.

Short- and long-read metagenomics expand individualized structural variations in gut microbiomes.

Investigating plant disease outbreaks with long-read metagenomics: sensitive detection and highly resolved phylogenetic reconstruction applied to Xylella fastidiosa.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Long-read Metagenomics Research Articles

Related Topics

Articles published on Long-read Metagenomics

Taxometer: Improving taxonomic classification of metagenomics contigs

Decomposing a San Francisco estuary microbiome using long-read metagenomics reveals species- and strain-level dominance from picoeukaryotes to viruses.

Metagenomic evaluation of bacteria in drinking water using full-length 16S rRNA amplicons.

The drainome: longitudinal metagenomic characterization of wastewater from hospital ward sinks to characterize the microbiome and resistome and to assess the effects of decontamination interventions

A microbiome survey of contrasting potato terroirs using 16S rRNA long-read sequencing

Towards facilitated interpretation of shotgun metagenomics long-read sequencing data analyzed with KMA for the detection of bacterial pathogens and their antimicrobial resistance genes.

Sensitivity and consistency of long- and short-read metagenomics and epicPCR for the detection of antibiotic resistance genes and their bacterial hosts in wastewater

Insights into gut microbiomes in stem cell transplantation by comprehensive shotgun long-read sequencing

Spatiotemporal dynamics of giant viruses within a deep freshwater lake reveal a distinct dark-water community.

Long-Read Metagenomics of Marine Microbes Reveals Diversely Expressed Secondary Metabolites.

Exploring Long-Read Metagenomics for Full Characterization of Shiga Toxin-Producing Escherichia coli in Presence of Commensal E. coli.

A novel and diverse group of Candidatus Patescibacteria from bathypelagic Lake Baikal revealed through long-read metagenomics

Long-Read Metagenomics and CAZyme Discovery.

Long-read metagenomics paves the way toward a complete microbial tree of life.

A step forward for Shiga toxin-producing Escherichia coli identification and characterization in raw milk using long-read metagenomics.

Insights into ecological roles of uncultivated bacteria in Katase hot spring sediment from long-read metagenomics.

Nanopore-based long-read metagenomics uncover the resistome intrusion by antibiotic resistant bacteria from treated wastewater in receiving water body

Long-Read Metagenomics Improves the Recovery of Viral Diversity from Complex Natural Marine Samples.

Short- and long-read metagenomics expand individualized structural variations in gut microbiomes.

Investigating plant disease outbreaks with long-read metagenomics: sensitive detection and highly resolved phylogenetic reconstruction applied to Xylella fastidiosa.