Abstract

Simple SummaryAfrican swine fever (ASF) is one of the most important animal diseases affecting the domestic swine population globally. Whole-genome sequence analysis on the circulating African swine fever virus (ASFV) strains would provide valuable information in tracking the outbreaks of the disease. The aim of this study was to prepare a curated dataset of ASFV genome sequences and investigate genome-wide diversity of circulating ASFV strains. We prepared a curated dataset containing 123 high-quality ASFV genome sequences representing 10 genotypes collected from 28 countries between 1949 and 2020. Phylogenetic analysis based on whole-genome sequences provided high-resolution topology in genotyping ASFV isolates, which was supported by pairwise genome sequence similarity comparison. Wide distribution and high variation of tandem repeat sequences were found in ASFV genomes. Structural variation and highly variable poly G or poly C tracts were also identified. This study improved our understanding on the patterns of genetic variation of ASFV and facilitated future studies on ASFV molecular epidemiology.African swine fever (ASF) is a lethal contagious viral disease of domestic pigs and wild boars caused by the African swine fever virus (ASFV). The pandemic spread of ASF has had serious effects on the global pig industry. Virus genome sequencing and comparison play an important role in tracking the outbreaks of the disease and tracing the transmission of the virus. Although more than 140 ASFV genome sequences have been deposited in the public databases, the genome-wide diversity of ASFV remains unclear. Here we prepared a curated dataset of ASFV genome sequences by filtering genomes with sequencing errors as well as duplicated genomes. A total of 123 ASFV genome sequences were included in the dataset, representing 10 genotypes collected between 1949 and 2020. Phylogenetic analysis based on whole-genome sequences provided high-resolution topology in differentiating closely related ASFV isolates, and drew new clues in the classification of some ASFV isolates. Genome-wide diversity of ASFV genomes was explored by pairwise sequence similarity comparison and ORF distribution comparison. Tandem repeat sequences were found widely distributed and highly varied in ASFV genomes. Structural variation and highly variable poly G or poly C tracts also contributed to the genome diversity. This study expanded our knowledge on the patterns of genetic diversity and evolution of ASFV, and provided valuable information for diagnosis improvement and vaccine development.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call