Abstract

Swine influenza viruses (SIVs) can unforeseeably cross the species barriers and directly infect humans, which pose huge challenges for public health and trigger pandemic risk at irregular intervals. Computational tools are needed to predict infection phenotype and early pandemic risk of SIVs. For this purpose, we propose a feature representation algorithm to predict cross-species infection of SIVs. We built a high-quality dataset of 1902 viruses. A feature representation learning scheme was applied to learn feature representations from 64 well-trained random forest models with multiple feature descriptors of mutant amino acid in the viral proteins, including compositional information, position-specific information, and physicochemical properties. Class and probabilistic information were integrated into the feature representations, and redundant features were removed by feature space optimization. High performance was achieved using 20 informative features and 22 probabilistic information. The proposed method will facilitate SIV characterization of transmission phenotype.

Highlights

  • The influenza A virus genome contains eight segmental RNAs [1]

  • The 769 human viruses were considered as positive samples because they were verified to have the ability of infection among humans

  • After ranking by the minimum redundance maximum relevance (mRMR) algorithm, the performance based on the optimal probabilistic information increased from 95.95% to 96.37% for ACC and from 91.59% to 92.46% for Mathew’s correlation coefficient (MCC), and the performance based on class information increased from 95.22% to 95.69%

Read more

Summary

Introduction

The influenza A virus (family: Orthomyxoviridae) genome contains eight segmental RNAs [1]. Besides the fast mutation rates of viral genes, segmental reassortments of viral genomes facilitate the occurrence of novel virus with the changes of host barriers [5, 6]. The 2009 H1N1 pandemic killed more than 18,000 people, and the viral pathogen during the pandemic was a novel SIV that was produced by genome reassortment between genomereassorted swine viruses from America and Europe [11,12,13]. H3N2 variant viruses (H3N2v), which have segment 7 from the 2009 H1N1 pandemic virus in their genomes, were identified in swine in 2010 and first detected in people in 2011 [14]. SIVs are a huge challenge for human public health and may trigger pandemic risk

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call