Abstract

The contemporary surge in metagenomic sequencing has transformed knowledge of viral diversity in wildlife. However, evaluating which newly discovered viruses pose sufficient risk of infecting humans to merit detailed laboratory characterization and surveillance remains largely speculative. Machine learning algorithms have been developed to address this imbalance by ranking the relative likelihood of human infection based on viral genome sequences, but are not yet routinely applied to viruses at the time of their discovery. Here, we characterized viral genomes detected through metagenomic sequencing of feces and saliva from common vampire bats (Desmodus rotundus) and used these data as a case study in evaluating zoonotic potential using molecular sequencing data. Of 58 detected viral families, including 17 which infect mammals, the only known zoonosis detected was rabies virus; however, additional genomes were detected from the families Hepeviridae, Coronaviridae, Reoviridae, Astroviridae and Picornaviridae, all of which contain human-infecting species. In phylogenetic analyses, novel vampire bat viruses most frequently grouped with other bat viruses that are not currently known to infect humans. In agreement, machine learning models built from only phylogenetic information ranked all novel viruses similarly, yielding little insight into zoonotic potential. In contrast, genome composition-based machine learning models estimated different levels of zoonotic potential, even for closely related viruses, categorizing one out of four detected hepeviruses and two out of three picornaviruses as having high priority for further research. We highlight the value of evaluating zoonotic potential beyond ad hoc consideration of phylogeny and provide surveillance recommendations for novel viruses in a wildlife host which has frequent contact with humans and domestic animals.

Highlights

  • As a positive control for the machine learning analyses, we evaluated a published genome of vampire bat RABV (Genbank accession EU293133) which was most similar to lineages circulating in Peru as determined by a nucleotide blast against Genbank

  • The quantitative risk assessments required for this are still in their infancy, we demonstrate a case study using both phylogenetics and machine learning models to initially assess viruses detected in vampire bats, a wildlife species in close and frequent contact with humans and domestic animals

  • The viruses characterized in our study represent a snapshot of the current viral community in vampire bats, and the genome composition-based (GCB) model allows us to prioritize those viruses

Read more

Summary

Introduction

While discoveries of novel viruses in wildlife are valuable to understand the host range and distribution of viruses across the tree of life, evaluating the risk of human infection from viral sequence data alone remains largely subjective, often relying on the presence of zoonotic viruses in the same viral family or evolutionary relatedness to known zoonoses or to human viruses. Such projections may not always be accurate for several reasons. These models, built from exclusively genomic data, have particular potential to contribute to evaluating which viruses merit further study in empirical metagenomic datasets, where the only source of information is genomic data

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call