On Evaluating MHC-II Binding Peptide Prediction Methods

Yasser El-Manzalawy,Drena Dobbs,Vasant Honavar

doi:10.1371/journal.pone.0003268

Yasser El-Manzalawy, Drena Dobbs + Show 1 more

Open Access

https://doi.org/10.1371/journal.pone.0003268

Copy DOI

Journal: PLoS ONE	Publication Date: Sep 24, 2008
Citations: 90	License type: CC BY 4.0

Affiliation: Iowa State University

Abstract

Choice of one method over another for MHC-II binding peptide prediction is typically based on published reports of their estimated performance on standard benchmark datasets. We show that several standard benchmark datasets of unique peptides used in such studies contain a substantial number of peptides that share a high degree of sequence identity with one or more other peptide sequences in the same dataset. Thus, in a standard cross-validation setup, the test set and the training set are likely to contain sequences that share a high degree of sequence identity with each other, leading to overly optimistic estimates of performance. Hence, to more rigorously assess the relative performance of different prediction methods, we explore the use of similarity-reduced datasets. We introduce three similarity-reduced MHC-II benchmark datasets derived from MHCPEP, MHCBN, and IEDB databases. The results of our comparison of the performance of three MHC-II binding peptide prediction methods estimated using datasets of unique peptides with that obtained using their similarity-reduced counterparts shows that the former can be rather optimistic relative to the performance of the same methods on similarity-reduced counterparts of the same datasets. Furthermore, our results demonstrate that conclusions regarding the superiority of one method over another drawn on the basis of performance estimates obtained using commonly used datasets of unique peptides are often contradicted by the observed performance of the methods on the similarity-reduced versions of the same datasets. These results underscore the importance of using similarity-reduced datasets in rigorously comparing the performance of alternative MHC-II peptide prediction methods.

Highlights

T-cells epitopes are short linear peptides generated by cleavage of antigenic proteins
MHCPEP, MHCBN, and Immune Epitope Database and Analysis Resource (IEDB) databases have a large number of highly similar peptides: the number of peptides in the similarityreduced versions in the three benchmark datasets is
Prediction methods evaluated on similarity-reduced datasets is substantially worse than that estimated using the datasets of unique peptides. This finding is especially significant in light of the fact that MHCPEP and MHCBN datasets have been used for comparing alternative major histocompatibility complex (MHC)-II peptide prediction methods in most of the published studies [5,6,15,16,17,18,19,25]

Summary

Introduction

T-cells epitopes are short linear peptides generated by cleavage of antigenic proteins. The identification of T-cell epitopes in protein sequences is important for understanding disease pathogenesis, identifying potential autoantigens, and designing vaccines and immune-based cancer therapies. A major step in identifying potential T-cell epitopes involves identifying the peptides that bind to a target major histocompatibility complex (MHC) molecule. The binding groove of MHC-II molecules is open at both ends, allowing peptides longer than 9-mers to bind. It has been reported that a 9-mer core region is essential for MHC-II binding [2,3]. Because the precise location of the 9mer core region of MHC-II binding peptides is unknown, predicting MHC-II binding peptides tends to be more challenging than predicting MHC-I binding peptides

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

On Evaluating MHC-II Binding Peptide Prediction Methods

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE

Lead the way for us

Similar Papers

Bird ticks in Hungary reflect western, southern, eastern flyway connections and two genetic lineages of Ixodes frontalis and Haemaphysalis concinna.
S Hornok ... T Csörgő
Parasites & Vectors | VOL. 9
S Hornok, et. al.S Hornok ... T Csörgő
24 Feb 2016
Parasites & Vectors | VOL. 9

Cloning of human cDNAs encoding mitochondrial and cytosolic serine hydroxymethyltransferases and chromosomal localization
T.A Garrow ... B Shane
Journal of Biological Chemistry | VOL. 268
T.A Garrow, et. al.T.A Garrow ... B Shane
01 Jun 1993
Journal of Biological Chemistry | VOL. 268

Glycerol transport and phosphoenolpyruvate-dependent enzyme I- and HPr-catalysed phosphorylation of glycerol kinase in Thermus flavus.
Emmanuelle Darbon ... Josef Deutscher
Microbiology | VOL. 145 ( Pt 11)
Emmanuelle Darbon, et. al.Emmanuelle Darbon ... Josef Deutscher
01 Nov 1999
Microbiology | VOL. 145 ( Pt 11)

The amino acid sequences, structure comparisons and inhibition kinetics of sheep cathepsin L and sheep stefin B
Anka Ritonja ... Clive Dennison
Comparative Biochemistry and Physiology, Part B | VOL. 114
Anka Ritonja, et. al.Anka Ritonja ... Clive Dennison
01 Jun 1996
Comparative Biochemistry and Physiology, Part B | VOL. 114

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On Evaluating MHC-II Binding Peptide Prediction Methods

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE