Abstract

Burkholderia sensu lato is a collection of closely related genera within the family Burkholderiaceae that includes species of environmental, industrial, biotechnological, and clinical importance. Multiple species within the complex are the source of diverse specialized metabolites, many of which have been identified through genome mining of their biosynthetic gene clusters (BGCs). However, the full, true genomic diversity of these species and genera, and their biosynthetic capacity have not been investigated. This study sought to cluster and classify over 4000 Burkholderia sensu lato genome assemblies into distinct genomic taxa representing named and uncharacterized species. We delineated 235 species groups by average nucleotide identity analyses that formed seven distinct phylogenomic clades, representing the genera of Burkholderia sensu lato: Burkholderia, Paraburkholderia, Trinickia, Caballeronia, Mycetohabitans, Robbsia, and Pararobbisa. A total of 137 genomic taxa aligned with named species possessing a sequenced type strain, while 93 uncharacterized species groups were demarcated. The 95% ANI threshold proved capable of delineating most genomic species and was only increased to resolve several closely related species. These analyses enabled the assessment of species classifications of over 4000 genomes, and the correction of over 400 genome taxonomic assignments in public databases into existing and uncharacterized genomic species groups. These species groups were genome mined for BGCs, their specialized metabolite capacity calculated per species and genus, and the number of distinct BGCs per species estimated through kmer-based de-replication. Mycetohabitans species dedicated a larger proportion of their relatively small genomes to specialized metabolite biosynthesis, while Burkholderia species harbored more BGCs on average per genome and possessed the most distinct BGCs per species compared to the remaining genera. Exploring the hidden genomic diversity of this important multi-genus complex contributes to our understanding of their taxonomy and evolutionary relationships, and supports future efforts toward natural product discovery.

Highlights

  • The multi-genus complex known as Burkholderia sensu lato has undergone multiple expansions and refinements following the emergence and description of the genus Burkholderia in 1992 (Yabuuchi et al, 1992; Figure 1A)

  • Using the wrapper download_genomes.sh script, all genomes assemblies of the genera Burkholderia, Paraburkholderia, Caballeronia, Trinickia, Mycetohabitans, Robbsia, and Pararobbsia were downloaded from National Center for Biotechnology Information (NCBI) (July 2020) using the ncbi-genome-download v0.2.112, and kmer sketches generated with Mash v2.2.2 (Ondov et al, 2016), respectively

  • While the seven Burkholderia sensu lato genera: Burkholderia, Paraburkholderia, Trinickia, Caballeronia, Mycetohabitans, Robbsia, and Pararobbisa, were closely related to each other, they occupied a clade alongside three genera not currently recognized as Burkholderia sensu lato: Chitinasiproducens, Mycoavidus, and “Candidatus Glomeribacter” (Figure 1B)

Read more

Summary

Introduction

The multi-genus complex known as Burkholderia sensu lato has undergone multiple expansions and refinements following the emergence and description of the genus Burkholderia in 1992 (Yabuuchi et al, 1992; Figure 1A). The most recent addition to the Burkholderia sensu lato was the description of the genus Pararobbsia (Lin et al, 2020) comprised of two species closely related to Robbsia. The original gold standard for species delineation was DNA-DNA hybridisation (DDH) with a species threshold set at 70% hybridization between bacterial genomes (Goris et al, 2007; Richter and Rosselló-Móra, 2009). While similar, targeted efforts were recently applied to understand and consolidate the existing genomic sequence data of the Burkholderia cepacia complex, a sub-group of closely related species within Burkholderia, on a curated collection of 116 high-quality or taxonomically significant genomes (Jin et al, 2020)

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call