Abstract

Extensive diversity has been identified in the human heavy chain immunoglobulin locus, including allelic variation, gene duplication, and insertion/deletion events. Several genes have been suggested to be deleted in many haplotypes. Such findings have commonly been based on inference of the germline repertoire from data sets covering antibody heavy chain encoding transcripts. The inference process operates under conditions that may limit identification of genes transcribed at low levels. The presence of rare transcripts that would indicate the existence of poorly expressed alleles in haplotypes that otherwise appear to have deleted these genes has been assessed in the present study. Alleles IGHV1-2*05, IGHV1-3*02, IGHV4-4*01, and IGHV7-4-1*01 were all identified as being expressed from multiple haplotypes, but only at low levels, haplotypes that by inference often appeared not to express these genes at all. These genes are thus not as commonly deleted as previously thought. An assessment of the 5’ untranslated region (up to and including the TATA-box), the signal peptide-encoding part of the gene, and the 3’-heptamer suggests that the alleles have no or minimal sequence difference in these regions in comparison to highly expressed alleles. This suggest that they may be able to participate in immunoglobulin gene rearrangement, transcription and translation. However, all four poorly expressed alleles harbor unusual sequence variants within their coding region that may compromise the functionality of the encoded products, thereby limiting their incorporation into the immunoglobulin repertoire. Transcripts based on IGHV7-4-1*01 that had undergone somatic hypermutation and class switch had mutated the codon that encoded the unusual residue in framework region 3 (cysteine 92; located far from the antigen binding site). This finding further supports the poor compatibility of this unusual residue in a fully functional protein product. Indications of a linkage disequilibrium were identified as IGHV1-2*05 and IGHV4-4*01 co-localized to the same haplotypes. Furthermore, transcripts of two of the poorly expressed alleles (IGHV1-3*02 and IGHV4-4*01) mostly do not encode in-frame, functional products, suggesting that these alleles might be essentially non-functional. It is proposed that the functionality status of immunoglobulin genes should also include assessment of their ability to encode functional protein products.

Highlights

  • IntroductionThe specificity-defining variable domains of human immunoglobulin heavy chains are encoded by genes [IGHV (immunoglobulin heavy chain variable), IGHD (immunoglobulin heavy chain diversity), and IGHJ (immunoglobulin heavy chain joining) genes] located on chromosome 14

  • The specificity-defining variable domains of human immunoglobulin heavy chains are encoded by genes [IGHV, IGHD, and IGHJ genes] located on chromosome 14

  • Immunoglobulin germline genes are described in the IMGT database [4], a source of information that is used by a range of bioinformatics tools to define the genes and the downstream hypermutation processes involved in establishment and evolution of particular antibodies

Read more

Summary

Introduction

The specificity-defining variable domains of human immunoglobulin heavy chains are encoded by genes [IGHV (immunoglobulin heavy chain variable), IGHD (immunoglobulin heavy chain diversity), and IGHJ (immunoglobulin heavy chain joining) genes] located on chromosome 14. Immunoglobulin germline genes are described in the IMGT database [4], a source of information that is used by a range of bioinformatics tools to define the genes and the downstream hypermutation processes involved in establishment and evolution of particular antibodies Such processes require precise definition of the germline IGHV, IGHD, and IGHJ genes that have been used to generate the genes that encode antibodies of interest. Germline gene inference based on the information content of generation sequencing (NGS) data sets has, emerged as an important approach to define personalized germline gene allele repertoires available to individuals to generate their antibody responses [7,8,9,10,11] These approaches allow for better gene/allele assignment and tracking of hypermutation pathways that have resulted in antibody sequences of the subject under investigation. It has been possible to identify allele usage bias and mosaic patterns of deleted genes in a large set of subjects [16], as well as functional deletions of large parts of the IGHV locus [15, 16]

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call