Abstract

Sequencing of DNA extracted from environmental samples can provide key insights into the biosynthetic potential of uncultured bacteria. However, the high complexity of soil metagenomes, which can contain thousands of bacterial species per gram of soil, imposes significant challenges to explore secondary metabolites potentially produced by rare members of the soil microbiome. Here, we develop a targeted sequencing workflow termed CONKAT-seq (co-occurrence network analysis of targeted sequences) that detects physically clustered biosynthetic domains, a hallmark of bacterial secondary metabolism. Following targeted amplification of conserved biosynthetic domains in a highly partitioned metagenomic library, CONKAT-seq evaluates amplicon co-occurrence patterns across library subpools to identify chromosomally clustered domains. We show that a single soil sample can contain more than a thousand uncharacterized biosynthetic gene clusters, most of which originate from low frequency genomes which are practically inaccessible through untargeted sequencing. CONKAT-seq allows scalable exploration of largely untapped biosynthetic diversity across multiple soils, and can guide the discovery of novel secondary metabolites from rare members of the soil microbiome.

Highlights

  • Sequencing of DNA extracted from environmental samples can provide key insights into the biosynthetic potential of uncultured bacteria

  • CONKAT-seq relies on the statistical analysis of amplicon co-occurrence across hundreds of library subpools to identify networks of physically clustered biosynthetic domains, and uses these predictions to point towards novel biosynthetic gene clusters (BGCs) encoded in a metagenome (Supplementary Fig. 1)

  • Our CONKAT-seq analysis suggests that a large fraction of the biosynthetic diversity in soil metagenomes is encoded by low frequency BGCs that have so-far remained inaccessible to untargeted sequencing efforts

Read more

Summary

Introduction

Sequencing of DNA extracted from environmental samples can provide key insights into the biosynthetic potential of uncultured bacteria. The high complexity of soil metagenomes, which can contain thousands of bacterial species per gram of soil, imposes significant challenges to explore secondary metabolites potentially produced by rare members of the soil microbiome. Despite the unparalleled sensitivity of PCR based experiments, critical information regarding the clustering of biosynthetic domains into pathways is inherently lost in the resulting heterogeneous mixture of single domain amplicons (Fig. 1a). To address this shortcoming and enable a more informative exploration of rare BGCs in soil metagenomes, we sought to develop a sequencing workflow that would reconstruct the chromosomal organization of biosynthetic domains into clusters from amplicon sequencing data

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call