Abstract

Herpes simplex virus serotype 2 (HSV-2) is a ubiquitous human pathogen that causes recurrent genital infections and ulcerations. Many HSV-2 strains with different biological properties have been identified, but only the genomes of HSV-2 strains HG52, SD90e and 333 have been reported as complete and fully characterized sequences. We de novo assembled, annotated and manually curated the complete genome sequence of HSV-2 strain MS, a highly neurovirulent strain, originally isolated from a multiple sclerosis patient. We resolved both DNA ends, as well as the complex inverted repeats regions present in HSV genomes, usually undisclosed in previous published partial herpesvirus genomes, using long reads from Pacific Biosciences (PacBio) technology. Additionally, we identified isomeric genomes by determining the alternative relative orientation of unique fragments in the genome of the sequenced viral population. Illumina short-read sequencing was crucial to examine genetic variability, such as nucleotide polymorphisms, insertion/deletions and sequence determinants of strain-specific virulence factors. We used Illumina data to fix two disrupted open reading frames found in coding homopolymers after PacBio assembly. These results support the combination of long- and short-read sequencing technologies as a precise and effective approach for the accurate de novo assembly and curation of complex microbial genomes.

Highlights

  • Herpes simplex virus 2 (HSV-2) is a major human pathogen that causes recurrent genital ulcerations after reactivation from the sacral dorsal root ganglia [1]

  • We obtained a total number of 1523339 subreads from Pacific Biosciences (PacBio) sequencing, which were de novo assembled with HGAP4 in default mode, giving three contigs as a result, with the longest being 155 961 bp long

  • We found a duplicated a sequence at the 5′ end of the contig, partially overlapping with a direct repeat (DR) 1 (DR1) sequence of 15 bp before the terminal repeat long (TRL) segment

Read more

Summary

Introduction

Herpes simplex virus 2 (HSV-2) is a major human pathogen that causes recurrent genital ulcerations after reactivation from the sacral dorsal root ganglia [1]. HSV may cause life-t­ hreatening diseases, including disseminated disease in the neonate and herpes simplex encephalitis [2,3,4]. This virus has an estimated global prevalence for adult populations ranging from 11.3 % in 2012 [5] to 13.2 % in 2016 [6], depending on socio-e­ conomic status and country. HSV-2 contains a large, linear double-­stranded DNA (dsDNA) genome of approximately 155 kilobase pairs (kbp) and is divided into unique long (UL) and unique short (US) regions, which are flanked by inverted repeats. Some of them have been laboratory-­adapted and partially characterized after serial

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call