Abstract

BackgroundT cells and B cells are essential in the adaptive immunity via expressing T cell receptors and immunoglogulins respectively for recognizing antigens. To recognize a wide variety of antigens, a highly diverse repertoire of receptors is generated via complex recombination of the receptor genes. Reasonably, frequencies of the recombination events have been shown to predict immune diseases and provide insights into the development of immunity. The field is further boosted by high-throughput sequencing and several computational tools have been released to analyze the recombined sequences. However, all current tools assume regular recombination of the receptor genes, which is not always valid in data prepared using a RACE approach. Compared to the traditional multiplex PCR approach, RACE is free of primer bias, therefore can provide accurate estimation of recombination frequencies. To handle the non-regular recombination events, a new computational program is needed.ResultsWe propose TRIg to handle non-regular T cell receptor and immunoglobulin sequences. Unlike all current programs, TRIg does alignments to the whole receptor gene instead of only to the coding regions. This brings new computational challenges, e.g., ambiguous alignments due to multiple hits to repetitive regions. To reduce ambiguity, TRIg applies a heuristic strategy and incorporates gene annotation to identify authentic alignments. On our own and public RACE datasets, TRIg correctly identified non-regularly recombined sequences, which could not be achieved by current programs. TRIg also works well for regularly recombined sequences.ConclusionsTRIg takes into account non-regular recombination of T cell receptor and immunoglobulin genes, therefore is suitable for analyzing RACE data. Such analysis will provide accurate estimation of recombination events, which will benefit various immune studies directly. In addition, TRIg is suitable for studying aberrant recombination in immune diseases. TRIg is freely available at https://github.com/TLlab/trig.Electronic supplementary materialThe online version of this article (doi:10.1186/s12859-016-1304-2) contains supplementary material, which is available to authorized users.

Highlights

  • T cells and B cells are essential in the adaptive immunity via expressing T cell receptors and immunoglogulins respectively for recognizing antigens

  • After the introduction of nextgeneration sequencing (NGS), which generates a large amount of data, new tools for analyzing T-cell receptor (TR) and Ig sequences are all geared toward faster speed

  • TRIg was compared to Decombinator, IgBLAST, and IMGT/HighV-Quest using our own Rapid amplification of cDNA ends (RACE) data of a healthy individual and two public data (Table 1)

Read more

Summary

Introduction

T cells and B cells are essential in the adaptive immunity via expressing T cell receptors and immunoglogulins respectively for recognizing antigens. After the introduction of nextgeneration sequencing (NGS), which generates a large amount of data, new tools for analyzing TR and Ig sequences are all geared toward faster speed. These include IMGT/HighV-QUEST [6], Decombinator [7], and the recent IgBLAST [8] and MiTCR [9]. Despite their distinct algorithms, all these tools do alignment only to the V(D)J regions instead of the whole gene to enhance speed. Software for subsequent analysis of diversity and clonality, e.g., tcR [10] and IMEX [11], are available

Results
Discussion
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.