Abstract

BackgroundThe broad heterogeneity of antigen-antibody interactions brings tremendous challenges to the design of a widely applicable learning algorithm to identify conformational B-cell epitopes. Besides the intrinsic heterogeneity introduced by diverse species, extra heterogeneity can also be introduced by various data sources, adding another layer of complexity and further confounding the research.ResultsThis work proposed a staged heterogeneity learning method, which learns both characteristics and heterogeneity of data in a phased manner. The method was applied to identify antigenic residues of heterogenous conformational B-cell epitopes based on antigen sequences. In the first stage, the model learns the general epitope patterns of each kind of propensity from a large data set containing computationally defined epitopes. In the second stage, the model learns the heterogenous complementarity of these propensities from a relatively small guided data set containing experimentally determined epitopes. Moreover, we designed an algorithm to cluster the predicted individual antigenic residues into conformational B-cell epitopes so as to provide strong potential for real-world applications, such as vaccine development. With heterogeneity well learnt, the transferability of the prediction model was remarkably improved to handle new data with a high level of heterogeneity. The model has been tested on two data sets with experimentally determined epitopes, and on a data set with computationally defined epitopes. This proposed sequence-based method achieved outstanding performance - about twice that of existing methods, including the sequence-based predictor CBTOPE and three other structure-based predictors.ConclusionsThe proposed method uses only antigen sequence information, and thus has much broader applications.

Highlights

  • The broad heterogeneity of antigen-antibody interactions brings tremendous challenges to the design of a widely applicable learning algorithm to identify conformational B-cell epitopes

  • These two models are used to predict antigenic residues, and an unsupervised clustering algorithm is deployed to cluster predicted antigenic residues into conformational B-cell epitopes

  • Our method was compared with CBTOPE, a sequence-based conformational B-cell epitope predictor [16]

Read more

Summary

Introduction

The broad heterogeneity of antigen-antibody interactions brings tremendous challenges to the design of a widely applicable learning algorithm to identify conformational B-cell epitopes. B-cell epitopes are able to stimulate B-cells to produce neutralizing antibodies, and can be used to design safe vaccines, especially for vulnerable populations such as infants, young children and the elderly, to be immunized against infectious diseases [2]. Their accurate prediction is of great significance, inhibited by several unsolved issues. One serious issue is the broad heterogeneity of epitope data

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call