Enhancing antigenic peptide discovery: Improved MHC-I binding prediction and methodology

Stanisław Giziński,Grzegorz Preibisch,Piotr Kucharski,Michał Tyrolski,Michał Rembalski,Piotr Grzegorczyk,Anna Gambin

doi:10.1016/j.ymeth.2024.01.016

Abstract

The Major Histocompatibility Complex (MHC) is a critical element of the vertebrate cellular immune system, responsible for presenting peptides derived from intracellular proteins. MHC-I presentation is pivotal in the immune response and holds considerable potential in the realms of vaccine development and cancer immunotherapy. This study delves into the limitations of current methods and benchmarks for MHC-I presentation. We introduce a novel benchmark designed to assess generalization properties and the reliability of models on unseen MHC molecules and peptides, with a focus on the Human Leukocyte Antigen (HLA)–a specific subset of MHC genes present in humans. Finally, we introduce HLABERT, a pretrained language model that outperforms previous methods significantly on our benchmark and establishes a new state-of-the-art on existing benchmarks.

Full Text