ClusTCR: a python interface for rapid clustering of large sets of CDR3 sequences with unknown antigen specificity.

Sebastiaan Valkiers,Pieter Meysman,Kris Laukens,Max Van Houcke

doi:10.1093/bioinformatics/btab446

Sebastiaan Valkiers, Pieter Meysman + Show 2 more

Open Access

https://doi.org/10.1093/bioinformatics/btab446

Copy DOI

Journal: Bioinformatics	Publication Date: Jun 16, 2021
Citations: 41	License type: cc-by-nc

Affiliation: University of Antwerp

Abstract

The T-cell receptor (TCR) determines the specificity of a T-cell towards an epitope. As of yet, the rules for antigen recognition remain largely undetermined. Current methods for grouping TCRs according to their epitope specificity remain limited in performance and scalability. Multiple methodologies have been developed, but all of them fail to efficiently cluster large datasets exceeding 1 million sequences. To account for this limitation, we developed ClusTCR, a rapid TCR clustering alternative that efficiently scales up to millions of CDR3 amino acid sequences, without knowledge about their antigen specificity. Benchmarking comparisons revealed similar accuracy of ClusTCR as compared to other TCR clustering methods, as measured by cluster retention, purity and consistency. ClusTCR offers a drastic improvement in clustering speed, which allows the clustering of millions of TCR sequences in just a few minutes through ultraefficient similarity searching and sequence hashing. ClusTCR was written in Python 3. It is available as an anaconda package (https://anaconda.org/svalkiers/clustcr) and on github (https://github.com/svalkiers/clusTCR). Supplementary data are available at Bioinformatics online.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ClusTCR: a python interface for rapid clustering of large sets of CDR3 sequences with unknown antigen specificity.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Similar Papers

Profiling of T Cell Repertoire in SARS-CoV-2-Infected COVID-19 Patients Between Mild Disease and Pneumonia.
Che-Mai Chang ... Tsung-Hsun Wu
Journal of Clinical Immunology | VOL. 41
Che-Mai Chang, et. al.Che-Mai Chang ... Tsung-Hsun Wu
05 May 2021
Journal of Clinical Immunology | VOL. 41

Decision letter: TCR meta-clonotypes for biomarker discovery with tcrdist3 enabled identification of public, HLA-restricted clusters of SARS-CoV-2 TCRs
Tahel Ronel ... Aleksandra M Walczak
-
Tahel Ronel, et. al.Tahel Ronel ... Aleksandra M Walczak
04 May 2021
04 May 2021

Editor's evaluation: TCR meta-clonotypes for biomarker discovery with tcrdist3 enabled identification of public, HLA-restricted clusters of SARS-CoV-2 TCRs
Benny Chain
-
Benny ChainBenny Chain
04 May 2021
04 May 2021

Investigation of Antigen-Specific T-Cell Receptor Clusters in Human Cancers.
Hongyi Zhang ... Xiaowei Zhan
Clinical Cancer Research | VOL. 26
Hongyi Zhang, et. al.Hongyi Zhang ... Xiaowei Zhan
13 Mar 2020
Clinical Cancer Research | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ClusTCR: a python interface for rapid clustering of large sets of CDR3 sequences with unknown antigen specificity.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics