Multi-task learning sparse group lasso: a method for quantifying antigenicity of influenza A(H1N1) virus using mutations and variations in glycosylation of Hemagglutinin

Lei Li,Xiaojian Zhang,Deborah Chang,Xiu-Feng Wan,Joseph Zaia,Lei Han

doi:10.1186/s12859-020-3527-5

Lei Li, Xiaojian Zhang + Show 4 more

Open Access

https://doi.org/10.1186/s12859-020-3527-5

Copy DOI

Abstract

BackgroundIn addition to causing the pandemic influenza outbreaks of 1918 and 2009, subtype H1N1 influenza A viruses (IAVs) have caused seasonal epidemics since 1977. Antigenic property of influenza viruses are determined by both protein sequence and N-linked glycosylation of influenza glycoproteins, especially hemagglutinin (HA). The currently available computational methods are only considered features in protein sequence but not N-linked glycosylation.ResultsA multi-task learning sparse group least absolute shrinkage and selection operator (LASSO) (MTL-SGL) regression method was developed and applied to derive two types of predominant features including protein sequence and N-linked glycosylation in hemagglutinin (HA) affecting variations in serologic data for human and swine H1N1 IAVs. Results suggested that mutations and changes in N-linked glycosylation sites are associated with the rise of antigenic variants of H1N1 IAVs. Furthermore, the implicated mutations are predominantly located at five reported antibody-binding sites, and within or close to the HA receptor binding site. All of the three N-linked glycosylation sites (i.e. sequons NCSV at HA 54, NHTV at HA 125, and NLSK at HA 160) identified by MTL-SGL to determine antigenic changes were experimentally validated in the H1N1 antigenic variants using mass spectrometry analyses. Compared with conventional sparse learning methods, MTL-SGL achieved a lower prediction error and higher accuracy, indicating that grouped features and MTL in the MTL-SGL method are not only able to handle serologic data generated from multiple reagents, supplies, and protocols, but also perform better in genetic sequence-based antigenic quantification.ConclusionsIn summary, the results of this study suggest that mutations and variations in N-glycosylation in HA caused antigenic variations in H1N1 IAVs and that the sequence-based antigenicity predictive model will be useful in understanding antigenic evolution of IAVs.

Highlights

In addition to causing the pandemic influenza outbreaks of 1918 and 2009, subtype H1N1 influenza A viruses (IAVs) have caused seasonal epidemics since 1977
In summary, the results of this study suggest that mutations and variations in N-glycosylation in HA caused antigenic variations in H1N1 IAVs and that the sequence-based antigenicity predictive model will be useful in understanding antigenic evolution of IAVs
We developed a multi-task learning sparse group least absolute shrinkage and selection operator (LASSO) (MTL-SGL) machine-learning model to assess antigenic changes in human, swine, and avian H1N1 IAVs

Summary

Introduction

In addition to causing the pandemic influenza outbreaks of 1918 and 2009, subtype H1N1 influenza A viruses (IAVs) have caused seasonal epidemics since 1977. Antigenic property of influenza viruses are determined by both protein sequence and N-linked glycosylation of influenza glycoproteins, especially hemagglutinin (HA). Two of four documented influenza pandemics (in 1918 and 2009) were caused by subtype H1N1 IAVs, and the 1918 resulting in > 40 million human deaths worldwide [2,3,4,5]. H1N1 IAVs have been a predominant cause of seasonal influenza outbreaks between 1918 to 1957 and since 1977. Sequence analyses showed numerous mutations in the HA of these A(H1N1)season1977 and A(H1N1)pdm viruses, including mutations in antibody binding sites and glycosylation sites [7]. Serologic characterization suggested that A(H1N1)pdm1918 has a low level of cross-reactivity with A(H1N1)pdm and that A(H1N1)season1977 and A(H1N1)pdm do not cross-react with each other [8,9,10,11]

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: May 11, 2020
Citations: 12	License type: open-access

R Discovery Prime

R Discovery Prime

Multi-task learning sparse group lasso: a method for quantifying antigenicity of influenza A(H1N1) virus using mutations and variations in glycosylation of Hemagglutinin

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Identification of human vaccinees that possess antibodies targeting the egg-adapted hemagglutinin receptor binding site of an H1N1 influenza vaccine strain
Tyler A Garretson ... Scott E Hensley
Vaccine | VOL. 36
Tyler A Garretson, et. al.Tyler A Garretson ... Scott E Hensley
31 May 2018
Vaccine | VOL. 36

Identification of Critical Residues in the Hemagglutinin and Neuraminidase of Influenza Virus H1N1pdm for Vaccine Virus Replication in Embryonated Chicken Eggs
Weijia Wang ... Janine Lu
Journal of Virology | VOL. 87
Weijia Wang, et. al.Weijia Wang ... Janine Lu
13 Feb 2013
Journal of Virology | VOL. 87

Altering the Immunogenicity of Hemagglutinin Immunogens by Hyperglycosylation and Disulfide Stabilization.
Dana N Thornlow ... Erica L Stover
Frontiers in immunology | VOL. 12
Dana N Thornlow, et. al.Dana N Thornlow ... Erica L Stover
07 Oct 2021
Frontiers in immunology | VOL. 12

Glycan Microarray Analysis of the Hemagglutinins from Modern and Pandemic Influenza Viruses Reveals Different Receptor Specificities
James Stevens ... Ian A Wilson
Journal of Molecular Biology | VOL. 355
James Stevens, et. al.James Stevens ... Ian A Wilson
18 Nov 2005
Journal of Molecular Biology | VOL. 355

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-task learning sparse group lasso: a method for quantifying antigenicity of influenza A(H1N1) virus using mutations and variations in glycosylation of Hemagglutinin

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics