Language model-based B cell receptor sequence embeddings can effectively encode receptor specificity.

Meng Wang,Yuval Kluger,Henry Li,Jonathan Patsenker,Steven H Kleinstein

doi:10.1093/nar/gkad1128

Meng Wang, Yuval Kluger + Show 3 more

Open Access

https://doi.org/10.1093/nar/gkad1128

Copy DOI

Abstract

High throughput sequencing of B cell receptors (BCRs) is increasingly applied to study the immense diversity of antibodies. Learning biologically meaningful embeddings of BCR sequences is beneficial for predictive modeling. Several embedding methods have been developed for BCRs, but no direct performance benchmarking exists. Moreover, the impact of the input sequence length and paired-chain information on the prediction remains to be explored. We evaluated the performance of multiple embedding models to predict BCR sequence properties and receptor specificity. Despite the differences in model architectures, most embeddings effectively capture BCR sequence properties and specificity. BCR-specific embeddings slightly outperform general protein language models in predicting specificity. In addition, incorporating full-length heavy chains and paired light chain sequences improves the prediction performance of all embeddings. This study provides insights into the properties of BCR embeddings to improve downstream prediction applications for antibody analysis and discovery.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nucleic acids research	Publication Date: Dec 18, 2023
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Language model-based B cell receptor sequence embeddings can effectively encode receptor specificity.

Abstract

Talk to us

Similar Papers

More From: Nucleic acids research

Lead the way for us

Similar Papers

Intrinsic Properties of immunoglobulin IgG1 Isotype-Switched B Cell Receptors Promote Microclustering and the Initiation of Signaling
Wanli Liu ... Susan K Pierce
Immunity | VOL. 32
Wanli Liu, et. al.Wanli Liu ... Susan K Pierce
01 Jun 2010
Immunity | VOL. 32

Dynamics of B cell repertoires and emergence of cross-reactive responses in patients with different severities of COVID-19.
Zachary Montague ... Giulio Isacchini
Cell Reports | VOL. 35
Zachary Montague, et. al.Zachary Montague ... Giulio Isacchini
01 May 2021
Cell Reports | VOL. 35

Decision letter: Characterisation of the immune repertoire of a humanised transgenic mouse through immunophenotyping and high-throughput sequencing
Satyajit Rath
-
Satyajit RathSatyajit Rath
19 Oct 2022
19 Oct 2022

Author response: Characterisation of the immune repertoire of a humanised transgenic mouse through immunophenotyping and high-throughput sequencing
Eve Richardson ... Aleksandr Kovaltsuk
-
Eve Richardson, et. al.Eve Richardson ... Aleksandr Kovaltsuk
19 Jan 2023
19 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Language model-based B cell receptor sequence embeddings can effectively encode receptor specificity.

Abstract

Talk to us

Similar Papers

More From: Nucleic acids research