Abstract

Influenza A virus genomes are composed of eight negative sense RNAs. In total, 16 proteins encoded by eight positive sense RNAs were identified. One putative protein coding sequence (PCS) encoded by genomic strand RNA of segment 8 has been previously proposed. In this study, 95,608, 123,965 and 35,699 genomic strand RNA sequences from influenza A viruses from avian, human and mammalian hosts, respectively, were used to identify PCSs encoded by the genomic strand RNAs. In total, 326,069 PCSs with lengths equal to or longer than 80 amino acids were identified and clustered into 270 PCS groups. Twenty of the 270 PCS groups which have greater than 10% proportion in influenza A viruses from avian, human or mammalian hosts were selected for detailed study. Maps of the 20 PCSGs in the influenza A virus genomes were constructed. The proportions of the 20 PCSGs in influenza A viruses from different hosts and serotypes were analyzed. One secretory and five membrane proteins predicted from the PCS groups encoded by genomic strand RNAs of segments 1, 2, 4, 6, 7 and 8 were identified. These results suggest the possibility of the ambisense nature of the influenza A virus genomic RNAs and a potential coding sequence reservoir encoding potential pan proteomes of influenza A viruses.

Highlights

  • Influenza A virus (IAV) genomes are composed of eight negative sense RNAs [1,2]

  • The eight open reading frame (ORF) (PB1, PB2, PA, HA, NP, NA, M1 and NS1) in the eight positive sense RNAs were used as reference reading frames (+1 reading frame) for PCS groups (PCSGs) encoded by eight genomic strand RNAs

  • The length and location of all PCSGs in the genomic strand RNAs are inconsistent between IAV genomes

Read more

Summary

Introduction

Influenza A virus (IAV) genomes are composed of eight negative (genomic) sense RNAs [1,2]. 16 proteins encoded by eight positive sense RNAs have been identified. Three proteins (PB1, PB1-F2 and N40) encoded by the positive sense RNA of segment 2 start at the 1st, 4th and 5th AUG, respectively [3,4,5]. Four proteins (PA, PA-X, PA-N155 and PA-N182) are encoded by the positive sense RNA of segment 3. The PA-X protein is a ribosomal frame-shifting product composed of the N-terminal domain of the PA protein (191 amino acids) and a short C-terminal domain (61 amino acids) that results from a +1 frameshift of the PA open reading frame (ORF) [6,7]. Three proteins (M1, M2 and M42) are encoded by the positive sense RNA of PLOS ONE | DOI:10.1371/journal.pone.0146936. Three proteins (M1, M2 and M42) are encoded by the positive sense RNA of PLOS ONE | DOI:10.1371/journal.pone.0146936 January 13, 2016

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call