Abstract
BackgroundVertebrate immune systems generate diverse repertoires of antibodies capable of mediating response to a variety of antigens. Next generation sequencing methods provide unique approaches to a number of immuno-based research areas including antibody discovery and engineering, disease surveillance, and host immune response to vaccines. In particular, single-molecule circular consensus sequencing permits the sequencing of antibody repertoires at previously unattainable depths of coverage and accuracy. We approached the bovine immunoglobulin G (IgG) repertoire with the objective of characterizing diversity of expressed IgG transcripts. Here we present single-molecule real-time sequencing data of expressed IgG heavy-chain repertoires of four individual cattle. We describe the diversity observed within antigen binding regions and visualize this diversity using a network-based approach.ResultsWe generated 49,945 high quality cDNA sequences, each spanning the entire IgG variable region from four Bos taurus calves. From these sequences we identified 49,521 antigen binding regions using the automated Paratome web server. Approximately 9% of all unique complementarity determining 2 (CDR2) sequences were of variable lengths. A bimodal distribution of unique CDR3 sequence lengths was observed, with common lengths of 5–6 and 21–25 amino acids. The average number of cysteine residues in CDR3s increased with CDR3 length and we observed that cysteine residues were centrally located in CDR3s. We identified 19 extremely long CDR3 sequences (up to 62 amino acids in length) within IgG transcripts. Network analyses revealed distinct patterns among the expressed IgG antigen binding repertoires of the examined individuals.ConclusionsWe utilized circular consensus sequencing technology to provide baseline data of the expressed bovine IgG repertoire that can be used for future studies important to livestock research. Somatic mutation resulting in base insertions and deletions in CDR2 further diversifies the bovine antibody repertoire. In contrast to previous studies, our data indicate that unusually long CDR3 sequences are not unique to IgM antibodies in cattle. Centrally located cysteine residues in bovine CDR3s provide further evidence that disulfide bond formation is likely of structural importance. We hypothesize that network or cluster-based analyses of expressed antibody repertoires from controlled challenge experiments will help identify novel natural antigen binding solutions to specific pathogens of interest.
Highlights
Vertebrate immune systems generate diverse repertoires of antibodies capable of mediating response to a variety of antigens
To facilitate primer design targeting the variable region of the heavy chain of immunoglobulin G (IgG) mRNA, we developed a database of bovine EST sequences based on BLAST searches of the bovine Heavy-chain variable domain (VH) region (GenBank accession numbers U55164–U55169, U55171, U55172, U55174, U55175 [10,23]) and constant regions of IgG1, IgG2, and IgG3 (GenBank accession numbers S82409, S82407, and BTU63638; [24,25])
Filtering based on quality score values reduced the set to 70,610 sequences, alignment of this reduced set to bovine consensus FR1 resulted in 68,169 sequences, elimination of sequences with stop codons and confirmation of residues encoding IgG C1 in the predicted reading frame resulted in 49,945 (12% of starting sequences) of the entire IgG VH region available for analysis
Summary
Vertebrate immune systems generate diverse repertoires of antibodies capable of mediating response to a variety of antigens. Alternative combinations of V, D, and J segments and junctional diversity, coupled with somatic hypermutation, generate a surprising number of potential antibody sequences with at least 1 × 107 unique antibody binding sites estimated for humans and Mus [7]. This diversity is in part due to the functional VH genes present in the germlines of these species, with humans having approximately 44 V, 27 D, and 6 J segments and Mus approximately 219 V, 21 D, and 4 J segments [8,9]. This CDR3 length variability is likely associated with the limited number of functional germline V, D, and J segments within the B. taurus genome, perhaps serving to further diversify the bovine immune response [13,14,17]
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.