Caused by both eukaryotic dinoflagellates and prokaryotic cyanobacteria, harmful algal blooms (HABs) are events of severe ecological, economic, and public health consequence, and their incidence has become more common of late. Despite coordinated research efforts to identify and characterize the genomes of HAB-causing organisms, the genomic basis and evolutionary origins of paralytic shellfish toxins (PSTs) produced by HABs remain at best incomplete. The PST saxitoxin has an especially complex genomic architecture and enigmatic phylogenetic distribution, spanning dinoflagellates and multiple cyanobacterial genera. Using filtration and extraction techniques to target the desired cyanobacteria from non-axenic culture, coupled with a combination of short and long read sequencing, we generated a reference-quality hybrid genome assembly for Heteroscytonema crispum UTEX LB 1556, a freshwater, PST-producing cyanobacterium thought to have the largest known genome in its phylum. We report a complete, novel biosynthetic gene cluster for the PST saxitoxin. Leveraging this biosynthetic gene cluster, we find support for the hypothesis that PST production has appeared in divergent Cyanobacteria lineages through widespread and repeated horizontal gene transfer. This work demonstrates the utility of long-read sequencing and metagenomic assembly toward advancing our understanding of PST biosynthetic gene cluster diversity and suggests a mechanism for the origin of PST biosynthetic genes.
Read full abstract