Prediction of conformational B-cell epitopes from 3D structures by random forests with a distance-based feature

Wen Zhang,Yi Xiong,Hua Zou,Juan Liu,Meng Zhao,Xinghuo Ye

doi:10.1186/1471-2105-12-341

Wen Zhang, Yi Xiong + Show 4 more

Open Access

https://doi.org/10.1186/1471-2105-12-341

Copy DOI

Journal: BMC Bioinformatics	Publication Date: Aug 17, 2011
Citations: 128	License type: CC BY 2.0

Affiliation: Wuhan University

Abstract

BackgroundAntigen-antibody interactions are key events in immune system, which provide important clues to the immune processes and responses. In Antigen-antibody interactions, the specific sites on the antigens that are directly bound by the B-cell produced antibodies are well known as B-cell epitopes. The identification of epitopes is a hot topic in bioinformatics because of their potential use in the epitope-based drug design. Although most B-cell epitopes are discontinuous (or conformational), insufficient effort has been put into the conformational epitope prediction, and the performance of existing methods is far from satisfaction.ResultsIn order to develop the high-accuracy model, we focus on some possible aspects concerning the prediction performance, including the impact of interior residues, different contributions of adjacent residues, and the imbalanced data which contain much more non-epitope residues than epitope residues. In order to address above issues, we take following strategies. Firstly, a concept of 'thick surface patch' instead of 'surface patch' is introduced to describe the local spatial context of each surface residue, which considers the impact of interior residue. The comparison between the thick surface patch and the surface patch shows that interior residues contribute to the recognition of epitopes. Secondly, statistical significance of the distance distribution difference between non-epitope patches and epitope patches is observed, thus an adjacent residue distance feature is presented, which reflects the unequal contributions of adjacent residues to the location of binding sites. Thirdly, a bootstrapping and voting procedure is adopted to deal with the imbalanced dataset. Based on the above ideas, we propose a new method to identify the B-cell conformational epitopes from 3D structures by combining conventional features and the proposed feature, and the random forest (RF) algorithm is used as the classification engine. The experiments show that our method can predict conformational B-cell epitopes with high accuracy. Evaluated by leave-one-out cross validation (LOOCV), our method achieves the mean AUC value of 0.633 for the benchmark bound dataset, and the mean AUC value of 0.654 for the benchmark unbound dataset. When compared with the state-of-the-art prediction models in the independent test, our method demonstrates comparable or better performance.ConclusionsOur method is demonstrated to be effective for the prediction of conformational epitopes. Based on the study, we develop a tool to predict the conformational epitopes from 3D structures, available at http://code.google.com/p/my-project-bpredictor/downloads/list.

Highlights

Antigen-antibody interactions are key events in immune system, which provide important clues to the immune processes and responses
We develop a novel method for predicting B-cell conformational epitopes by using the random forest (RF) algorithm with the combination of the adjacent residue distance feature and several conventional features
Performance of models based on the surface patch and thick surface patch In order to evaluate the impact of interior residues, the surface patch-based prediction models and the thick surface patch-based models are built by combining conventional features

Summary

Introduction

Antigen-antibody interactions are key events in immune system, which provide important clues to the immune processes and responses. The classic way of predicting linear B-cell epitopes is based on amino acid propensities [5,6,7,8,9,10]. These commonly used propensities are hydrophilicity scale, flexibility scale, surface accessibility scale, exposed residue scale, beta-turn scale, antigenicity scale, polarity scale and so on. The machine learning-based models can well describe the nonlinear relationship between propensities and the location of linear epitopes, and lead to the improved performance. These linear epitope prediction methods cannot be used to predict conformational epitopes, which take majority of the epitopes

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Prediction of conformational B-cell epitopes from 3D structures by random forests with a distance-based feature

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

An ensemble method for prediction of conformational B-cell epitopes from antigen sequences
Wei Zheng ... Jianzhao Gao
Computational Biology and Chemistry | VOL. 49
Wei Zheng, et. al.Wei Zheng ... Jianzhao Gao
18 Feb 2014
Computational Biology and Chemistry | VOL. 49

Computational prediction of conformational B-cell epitopes from antigen primary structures by ensemble learning.
Wen Zhang ... Juan Liu
PLoS ONE | VOL. 7
Wen Zhang, et. al.Wen Zhang ... Juan Liu
21 Aug 2012
PLoS ONE | VOL. 7

Pep-3D-Search: a method for B-cell epitope prediction based on mimotope analysis
Yan Xin Huang ... Yan Wang
BMC Bioinformatics | VOL. 9
Yan Xin Huang, et. al.Yan Xin Huang ... Yan Wang
01 Dec 2008
BMC Bioinformatics | VOL. 9

A novel conformational B-cell epitope prediction method based on mimotope and patch analysis
Pingping Sun ... Yuxin Li
Journal of Theoretical Biology | VOL. 394
Pingping Sun, et. al.Pingping Sun ... Yuxin Li
22 Jan 2016
Journal of Theoretical Biology | VOL. 394

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Prediction of conformational B-cell epitopes from 3D structures by random forests with a distance-based feature

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics