In recent years, there has been a surge in metagenomic studies focused on identifying plasmids in environmental samples. Although these studies have unearthed numerous novel plasmids, enriching our understanding of their environmental roles, a significant gap remains: the scarcity of information regarding the bacterial hosts of these newly discovered plasmids. Furthermore, even when plasmids are identified within bacterial isolates, the reported host is typically limited to the original isolate, with no insights into alternative hosts or the plasmid's potential host range. Given that plasmids depend on hosts for their existence, investigating plasmids without the knowledge of potential hosts offers only a partial perspective. This study introduces a method for identifying potential hosts and host ranges for plasmids through alignment with CRISPR spacers. To validate the method, we compared the PLSDB plasmids database with the CRISPR spacers database, yielding host predictions for 46% of the plasmids. When compared with reported hosts, our predictions achieved 84% concordance at the family level and 99% concordance at the phylum level. Moreover, the method frequently identified multiple potential hosts for a plasmid, thereby enabling predictions of alternative hosts and the host range. Notably, we found that CRISPR spacers predominantly target plasmid backbone genes while sparing functional genes, such as those linked to antibiotic resistance, aligning with our hypothesis that CRISPR spacers are acquired from plasmid-specific regions rather than insertion elements from diverse sources. Finally, we illustrate the network of connections among different bacterial taxa through plasmids, revealing potential pathways for horizontal gene transfer.IMPORTANCEPlasmids are notorious for their role in distributing antibiotic resistance genes, but they may also carry and distribute other environmentally important genes. Since plasmids are not free-living entities and rely on host bacteria for survival and propagation, predicting their hosts is essential. This study presents a method for predicting potential hosts for plasmids and offers insights into the potential paths for spreading functional genes between different bacteria. Understanding plasmid-host relationships is crucial for comprehending the ecological and clinical impact of plasmids and implications for various biological processes.
Read full abstract