Abstract

Several decades of research have convincingly shown that classical human leukocyte antigen (HLA) loci bear signatures of natural selection. Despite this conclusion, many questions remain regarding the type of selective regime acting on these loci, the time frame at which selection acts, and the functional connections between genetic variability and natural selection. In this review, we argue that genomic datasets, in particular those generated by next-generation sequencing (NGS) at the population scale, are transforming our understanding of HLA evolution. We show that genomewide data can be used to perform robust and powerful tests for selection, capable of identifying both positive and balancing selection at HLA genes. Importantly, these tests have shown that natural selection can be identified at both recent and ancient timescales. We discuss how findings from genomewide association studies impact the evolutionary study of HLA genes, and how genomic data can be used to survey adaptive change involving interaction at multiple loci. We discuss the methodological developments which are necessary to correctly interpret genomic analyses involving the HLA region. These developments include adapting the NGS analysis framework so as to deal with the highly polymorphic HLA data, as well as developing tools and theory to search for signatures of selection, quantify differentiation, and measure admixture within the HLA region. Finally, we show that high throughput analysis of molecular phenotypes for HLA genes—namely transcription levels—is now a feasible approach and can add another dimension to the study of genetic variation.

Highlights

  • The availability of genomic data at the scale of populations is transforming our understanding of the processes shaping human genetic variation

  • We found that frequency estimates for human leukocyte antigen (HLA) SNPs are relatively robust, the SNP genotype calls within the HLA loci have alarmingly high error rates (18.6% of calls are incorrect) and are biased toward over-representing the alleles present in the reference genome

  • Leveraged by very large samples of sequence data, they identified genomic regions where selection has driven advantageous alleles to high frequencies in a time frame as recent as 2 000 years, and found that at least three independent SNPs within the extended MHC region were among the most significant targets (Field et al 2016). This test is designed to detect recent positive selection, implying that balancing selection should not be seen as the only regime relevant to HLA evolution

Read more

Summary

Present address

Keywords HLA (human leukocyte antigen) · MHC (major histocompatibility complex) · Evolution · Genomics · Balancing selection

Introduction
Findings
Method

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.