Abstract
Whole-genome sequencing (WGS) of viruses from patient or environmental samples can provide tremendous insight into the epidemiology, drug resistance or evolution of a virus. However, we face two common hurdles in obtaining robust sequence information; the low copy number of viral genomes in specimens and the error introduced by WGS techniques. To optimize detection and minimize error in WGS of hantaviruses, we tested four amplification approaches and different amplicon pooling methods for library preparation and examined these preparations using two sequencing platforms, Illumina MiSeq and Oxford Nanopore Technologies MinION. First, we tested and optimized primers used for whole segment PCR or one kilobase amplicon amplification for even coverage using RNA isolated from the supernatant of virus-infected cells. Once optimized we assessed two sources of total RNA, virus-infected cells and supernatant from the virus-infected cells, with four variations of primer pooling for amplicons, and six different amplification approaches. We show that 99–100% genome coverage was obtained using a one-step RT-PCR reaction with one forward and reverse primer. Using a two-step RT-PCR with three distinct tiling approaches for the three genomic segments (vRNAs), we optimized primer pooling approaches for PCR amplification to achieve a greater number of aligned reads, average depth of genome, and genome coverage. The single nucleotide polymorphisms identified from MiSeq and MinION sequencing suggested intrinsic mutation frequencies of ~10−5-10−7 per genome and 10−4-10−5 per genome, respectively. We noted no difference in the coverage or accuracy when comparing WGS results with amplicons amplified from RNA extracted from infected cells or supernatant of these infected cells. Our results show that high-throughput diagnostics requiring the identification of hantavirus species or strains can be performed using MiSeq or MinION using a one-step approach. However, the two-step MiSeq approach outperformed the MinION in coverage depth and accuracy, and hence would be superior for assessment of genomes for epidemiology or evolutionary questions using the methods developed herein.
Highlights
Hantaviruses, genus Orthohantavirus, family Hantaviridae, order Bunyavirales, are broadly distributed in nature in Old and New World rodents (Peters and Khan, 2002; Jonsson et al, 2010; Vaheri et al, 2013)
Spillover of several of these viruses from rodents to humans through inhalation of aerosolized excreta can cause serious illness resulting in hantavirus pulmonary syndrome (HPS) in the Americas or hemorrhagic fever with renal syndrome (HFRS) in Europe and Asia with case fatality rates ranging from 1–40% (Peters and Khan, 2002)
Diagnostic testing for patients suspected of having HFRS or HPS typically calls for qRT-PCR detection based on hantavirus S segment from blood, which does not typically discriminate among the viral vRNA or complementary RNA (cRNA)/messenger RNA (mRNA) (Terajima et al, 1999; Evander et al, 2007), and/or detection through IgM antibody capture (Le Duc et al, 1990; Ksiazek et al, 1995; Hujakka et al, 2003)
Summary
Hantaviruses, genus Orthohantavirus, family Hantaviridae, order Bunyavirales, are broadly distributed in nature in Old and New World rodents (Peters and Khan, 2002; Jonsson et al, 2010; Vaheri et al, 2013). Diagnostic testing for patients suspected of having HFRS or HPS typically calls for qRT-PCR detection based on hantavirus S segment from blood, which does not typically discriminate among the viral vRNA or cRNA/mRNA (Terajima et al, 1999; Evander et al, 2007), and/or detection through IgM antibody capture (Le Duc et al, 1990; Ksiazek et al, 1995; Hujakka et al, 2003) Neither of these approaches provides information regarding the specific strain or genotype of hantavirus, whether the virus represents a new sequence variant or whether the infection represents a new reassortment of the genomes. With these gaps in mind, we designed, tested and evaluated several WGS approaches and methods with the goal of creating a pipeline that would result in high quality sequences for the purpose of studies of ecology, evolution, or molecular epidemiology
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have