TULIP: A transformer-based unsupervised language model for interacting peptides and T cell receptors that generalizes to unseen epitopes

Barthelemy Meynard-Piganeau,Barthelemy Meynard-Piganeau,Christoph Feinauer,Martin Weigt,Aleksandra M Walczak,Thierry Mora

doi:10.1073/pnas.2316401121

Barthelemy Meynard-Piganeau, Barthelemy Meynard-Piganeau + Show 4 more

https://doi.org/10.1073/pnas.2316401121

Copy DOI

Abstract

The accurate prediction of binding between T cell receptors (TCR) and their cognate epitopes is key to understanding the adaptive immune response and developing immunotherapies. Current methods face two significant limitations: the shortage of comprehensive high-quality data and the bias introduced by the selection of the negative training data commonly used in the supervised learning approaches. We propose a method, Transformer-based Unsupervised Language model for Interacting Peptides and T cell receptors (TULIP), that addresses both limitations by leveraging incomplete data and unsupervised learning and using the transformer architecture of language models. Our model is flexible and integrates all possible data sources, regardless of their quality or completeness. We demonstrate the existence of a bias introduced by the sampling procedure used in previous supervised approaches, emphasizing the need for an unsupervised approach. TULIP recognizes the specific TCRs binding an epitope, performing well on unseen epitopes. Our model outperforms state-of-the-art models and offers a promising direction for the development of more accurate TCR epitope recognition models.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

TULIP: A transformer-based unsupervised language model for interacting peptides and T cell receptors that generalizes to unseen epitopes

Abstract

Talk to us

Similar Papers

More From: Proceedings of the National Academy of Sciences of the United States of America

Lead the way for us

Journal: Proceedings of the National Academy of Sciences of the United States of America	Publication Date: Jun 5, 2024
License type: CC BY-NC-ND 4.0

Similar Papers

Application of Transformer-Based Language Models to Detect Hate Speech in Social Media
Swapnanil Mukherjee ... Sujit Das
Journal of Computational and Cognitive Engineering | VOL. 2
Swapnanil Mukherjee, et. al.Swapnanil Mukherjee ... Sujit Das
17 Dec 2021
Journal of Computational and Cognitive Engineering | VOL. 2

Self-supervised learning of T cell receptor sequences exposes core properties for T cell membership.
Romi Goldner Kabeli ... Sol Efroni
Science Advances | VOL. 10
Romi Goldner Kabeli, et. al.Romi Goldner Kabeli ... Sol Efroni
26 Apr 2024
Science Advances | VOL. 10

Shapes of MHC Restriction
David N Garboczi ... William E Biddison
Immunity | VOL. 10
David N Garboczi, et. al.David N Garboczi ... William E Biddison
01 Jan 1998
Immunity | VOL. 10

The Generalization and Robustness of Transformer-Based Language Models on Commonsense Reasoning
Ke Shen
Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence | VOL. 38
Ke ShenKe Shen
24 Mar 2024
Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TULIP: A transformer-based unsupervised language model for interacting peptides and T cell receptors that generalizes to unseen epitopes

Abstract

Talk to us

Similar Papers

More From: Proceedings of the National Academy of Sciences of the United States of America