Abstract

BackgroundIdentification of acute or recent hepatitis C virus (HCV) infections is important for detecting outbreaks and devising timely public health interventions for interruption of transmission. Epidemiological investigations and chemistry-based laboratory tests are 2 main approaches that are available for identification of acute HCV infection. However, owing to complexity, both approaches are not efficient. Here, we describe a new sequence alignment-free method to discriminate between recent (R) and chronic (C) HCV infection using next-generation sequencing (NGS) data derived from the HCV hypervariable region 1 (HVR1).ResultsUsing dinucleotide auto correlation (DAC), we identified physical-chemical (PhyChem) features of HVR1 variants. Significant (p < 9.58 × 10−4) differences in the means and frequency distributions of PhyChem features were found between HVR1 variants sampled from patients with recent vs chronic (R/C) infection. Moreover, the R-associated variants were found to occupy distinct and discrete PhyChem spaces. A radial basis function neural network classifier trained on the PhyChem features of intra-host HVR1 variants accurately classified R/C-HVR1 variants (classification accuracy (CA) = 94.85%; area under the ROC curve, AUROC = 0.979), in 10-fold cross-validation). The classifier was accurate in assigning individual HVR1 variants to R/C-classes in the testing set (CA = 84.15%; AUROC = 0.912) and in detection of infection duration (R/C-class) in patients (CA = 88.45%). Statistical tests and evaluation of the classifier on randomly-labeled datasets indicate that classifiers’ CA is robust (p < 0.001) and unlikely due to random correlations (CA = 59.04% and AUROC = 0.50).ConclusionsThe PhyChem features of intra-host HVR1 variants are strongly associated with the duration of HCV infection. Application of the PhyChem biomarkers to models for detection of the R/C-state of HCV infection in patients offers a new opportunity for detection of outbreaks and for molecular surveillance. The method will be available at https://webappx.cdc.gov/GHOST/ to the authenticated users of Global Hepatitis Outbreak and Surveillance Technology (GHOST) for further testing and validation.

Highlights

  • Identification of acute or recent hepatitis C virus (HCV) infections is important for detecting outbreaks and devising timely public health interventions for interruption of transmission

  • We describe the application of the DNA dinucleotide-based autocovariance (DAC) method to effectively identify relevant PhyChem features of hypervariable region 1 (HVR1) variants, and the implementation of a radial basis function neural network (RBFNN) classifier to discriminate between Rand C-associated intra-host HVR1 variants without need of multiple sequence alignment (MSA) prior to the classification test

  • HVR1 PhyChem features associated with R/C states The Welch’s t-test was used to examine variances in nt and DAC-based PhyChem features in HVR1 sequence data obtained from R (n = 124) and C (n = 98) patients

Read more

Summary

Introduction

Identification of acute or recent hepatitis C virus (HCV) infections is important for detecting outbreaks and devising timely public health interventions for interruption of transmission. Epidemiological investigations and chemistry-based laboratory tests are 2 main approaches that are available for identification of acute HCV infection. Accurate identification of acute or recent hepatitis C infection is essential for identification of outbreaks and for devising timely public health interventions to interrupt transmissions. Epidemiological investigation allows for the detection of recent infection. Epidemiological support may be limited, and information on duration of HCV infection may not be available. Detection of seroconversion is time-consuming, and avidity tests are not broadly available, rendering both approaches of impractical for surveillance. There are not costeffective and reliable methods suitable for large-scale identification of recently acquired HCV infection

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call