Abstract

AbstractThe cDNA sequences of immunoglobulins (IG) and T cell receptors (TR) represent more than one half of the sequences in the IMGT^®^ nucleotide database IMGT/LIGM-DB^1^ and 75% of them are from human and mouse. A few cDNA are germline but the great majority results from a V-D-J or V-J gene rearrangement, spliced to a C gene. The IG and TR genes have been studied extensively in IMGT^®^ ("http://www.imgt.org":http://www.imgt.org) ^2^, which allowed to set up their nomenclature and the corresponding germline reference sequences. These standardized reference directory sets (one for each group of each locus) and the IMGT-ONTOLOGY axioms and derived concepts^3^ are the key elements indispensable to perform the annotation of IG and TR cDNA sequences. A Java program, IMGT/Automat^4^, was developed by IMGT^®^, to automatically annotate the IG and TR cDNA sequences and to produce a totally automatic and complete annotation. More than 9,000 human and mouse cDNA have already been successfully automatically annotated. The quality of the cDNA automatic annotation is equivalent to the quality of the annotation achieved by a human expert. The IMGT^®^ strategy is currently the only way, in the field of immunogenetics, to guarantee the annotation quality and the management of an always increasing number of IG and TR cDNA nucleotide sequences.

Highlights

  • IMGT®, the international ImMunoGeneTics information system®, Laboratoire d'ImmunoGénétique Moléculaire LIGM, Université Montpellier 2, Institut de Génétique Humaine IGH, UPR CNRS 1142, 141 rue de la Cardonille, F-34396 Montpellier cedex 05, France

  • The cDNA sequences of immunoglobulins (IG) and T cell receptors (TR) represent more than one half of the sequences in the IMGT® nucleotide database IMGT/LIGM-DB [1] and 75% of them are from human and mouse

  • The description of the V-D-J and V-J junction is performed by the IMGT/JunctionAnalysis [6] tool

Read more

Summary

Im Muno Gene Tics

The cDNA sequences of immunoglobulins (IG) and T cell receptors (TR) represent more than one half of the sequences in the IMGT® nucleotide database IMGT/LIGM-DB [1] and 75% of them are from human and mouse. The IG and TR genes have been studied extensively in IMGT® (http://www.imgt.org) [2], which allowed to set up their nomenclature and the corresponding germline reference sequences. These standardized reference directory sets (one for each group of each locus) and the IMGT-ONTOLOGY axioms and derived concepts [3] are the key elements indispensable to perform the annotation of IG and TR cDNA sequences.

Initiation codon found
Functionality determination
Codon stop terminal found
Composed coding regions
IDENTIFICATION Keywords
FT FT FT
IMGT annotatedFT at le
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call