Abstract

Given thousands of proteins constituting a eukaryotic pathogen, the principal objective for a high-throughput in silico vaccine discovery pipeline is to select those proteins worthy of laboratory validation. Accurate prediction of T-cell epitopes on protein antigens is one crucial piece of evidence that would aid in this selection. Prediction of peptides recognised by T-cell receptors have to date proved to be of insufficient accuracy. The in silico approach is consequently reliant on an indirect method, which involves the prediction of peptides binding to major histocompatibility complex (MHC) molecules. There is no guarantee nevertheless that predicted peptide-MHC complexes will be presented by antigen-presenting cells and/or recognised by cognate T-cell receptors. The aim of this study was to determine if predicted peptide-MHC binding scores could provide contributing evidence to establish a protein’s potential as a vaccine. Using T-Cell MHC class I binding prediction tools provided by the Immune Epitope Database and Analysis Resource, peptide binding affinity to 76 common MHC I alleles were predicted for 160 Toxoplasma gondii proteins: 75 taken from published studies represented proteins known or expected to induce T-cell immune responses and 85 considered less likely vaccine candidates. The results show there is no universal set of rules that can be applied directly to binding scores to distinguish a vaccine from a non-vaccine candidate. We present, however, two proposed strategies exploiting binding scores that provide supporting evidence that a protein is likely to induce a T-cell immune response–one using random forest (a machine learning algorithm) with a 72% sensitivity and 82.4% specificity and the other, using amino acid conservation scores with a 74.6% sensitivity and 70.5% specificity when applied to the 160 benchmark proteins. More importantly, the binding score strategies are valuable evidence contributors to the overall in silico vaccine discovery pool of evidence.

Highlights

  • An in silico protein-based vaccine discovery pipeline for eukaryotic pathogens, inspired by reverse vaccinology [1,2,3,4,5,6], encapsulates a collection of various bioinformatics prediction tools [7]

  • Some of the rules showed promise but failed when applied to different datasets. These results suggest that both vaccine and non-vaccine candidates contain high-affinity binding peptides, peptides that bind to the same major histocompatibility complex (MHC) allele, have similar numbers of binding peptides and promiscuous peptides per protein, and have similar numbers of peptides that bind to promiscuous MHCs

  • The aim of this study was to determine whether predicted peptide-MHC I binding scores for thousands of proteins from a target pathogen could contribute evidence to the in silico discovery of vaccine candidates

Read more

Summary

Introduction

An in silico protein-based vaccine discovery pipeline for eukaryotic pathogens, inspired by reverse vaccinology [1,2,3,4,5,6], encapsulates a collection of various bioinformatics prediction tools [7]. The aim of these tools is to gather computational evidence, derived mainly from protein sequences, to select the most promising vaccine candidates worthy of laboratory validation [8]. There are two computational approaches to T-cell epitope prediction based on direct and indirect methods. This paper focuses on the indirect method and the MHC class I molecule

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call