Deep generative models for T cell receptor protein sequences.

Kristian Davidsen,Philip Bradley,Frederick A Matsen,William S Dewitt,Jean Feng,Branden J Olson,Elias Harkins

doi:10.7554/elife.46935

Kristian Davidsen, Philip Bradley + Show 5 more

Open Access

https://doi.org/10.7554/elife.46935

Copy DOI

Abstract

Probabilistic models of adaptive immune repertoire sequence distributions can be used to infer the expansion of immune cells in response to stimulus, differentiate genetic from environmental factors that determine repertoire sharing, and evaluate the suitability of various target immune sequences for stimulation via vaccination. Classically, these models are defined in terms of a probabilistic V(D)J recombination model which is sometimes combined with a selection model. In this paper we take a different approach, fitting variational autoencoder (VAE) models parameterized by deep neural networks to T cell receptor (TCR) repertoires. We show that simple VAE models can perform accurate cohort frequency estimation, learn the rules of VDJ recombination, and generalize well to unseen sequences. Further, we demonstrate that VAE-like models can distinguish between real sequences and sequences generated according to a recombination-selection model, and that many characteristics of VAE-generated sequences are similar to those of real sequences.

Highlights

T cell receptors (TCRs) are composed of an a and a b protein chain, both originating from a random V(D)J recombination process, followed by selective steps that ensure functionality and limit autoreactivity
We model TCR sequences using simple variants of variational autoencoders (VAEs)
VAE models can be described as consisting of an n-dimensional latent space, a prior pðzÞ on that latent space, and probabilistic maps parameterized by two neural networks: an encoder qfðzjxÞ and a decoder pðx^jzÞ (Figure 1; Kingma et al, 2014b)

Summary

Introduction

T cell receptors (TCRs) are composed of an a and a b protein chain, both originating from a random V(D)J recombination process, followed by selective steps that ensure functionality and limit autoreactivity. To generate diverse and functional TCRs, T cells combine a stochastic process for choosing from a pool of V, D and J genes with a process for selecting for expression and MHC recognition. The resulting ensemble of protein sequences summarizes each individual’s previous immune exposures and largely determines their resistance to various infections. One can consider these protein sequences as a sample from a probability distribution, whether it is the distribution of receptors within an individual, or the distribution of receptors in a population. This article concerns fitting such probability distributions on TCR b protein sequences (which will be called ‘TCR sequences’ for the rest of the paper)

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: eLife	Publication Date: Sep 5, 2019
Citations: 69	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Deep generative models for T cell receptor protein sequences.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: eLife

Lead the way for us

Similar Papers

Author response: Deep generative models for T cell receptor protein sequences
Kristian Davidsen ... Jean Feng
-
Kristian Davidsen, et. al.Kristian Davidsen ... Jean Feng
05 Aug 2019
05 Aug 2019

Application of Variational AutoEncoder (VAE) Model and Image Processing Approaches in Game Design
Hugo Wai Leung Mak ... Runze Han
Sensors | VOL. 23
Hugo Wai Leung Mak, et. al.Hugo Wai Leung Mak ... Runze Han
25 Mar 2023
Sensors | VOL. 23

Dynamics of B cell repertoires and emergence of cross-reactive responses in patients with different severities of COVID-19.
Zachary Montague ... Giulio Isacchini
Cell Reports | VOL. 35
Zachary Montague, et. al.Zachary Montague ... Giulio Isacchini
01 May 2021
Cell Reports | VOL. 35

Normalized Synergy Predicts That CD8 Co-Receptor Contribution to T Cell Receptor (TCR) and pMHC Binding Decreases As TCR Affinity Increases in Human Viral-Specific T Cells.
Chad M Williams ... Alexandra A Schonnesen
Frontiers in immunology | VOL. 8
Chad M Williams, et. al.Chad M Williams ... Alexandra A Schonnesen
28 Jul 2017
Frontiers in immunology | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep generative models for T cell receptor protein sequences.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: eLife