Abstract

BackgroundAlthough the reference human genome sequence was declared finished in 2003, some regions of the genome remain incomplete due to their complex architecture. One such region, 1q21.1-q21.2, is of increasing interest due to its relevance to human disease and evolution. Elucidation of the exact variants behind these associations has been hampered by the repetitive nature of the region and its incomplete assembly. This region also contains 238 of the 270 human DUF1220 protein domains, which are implicated in human brain evolution and neurodevelopment. Additionally, examinations of this protein domain have been challenging due to the incomplete 1q21 build. To address these problems, a single-haplotype hydatidiform mole BAC library (CHORI-17) was used to produce the first complete sequence of the 1q21.1-q21.2 region.ResultsWe found and addressed several inaccuracies in the GRCh37sequence of the 1q21 region on large and small scales, including genomic rearrangements and inversions, and incorrect gene copy number estimates and assemblies. The DUF1220-encoding NBPF genes required the most corrections, with 3 genes removed, 2 genes reassigned to the 1p11.2 region, 8 genes requiring assembly corrections for DUF1220 domains (~91 DUF1220 domains were misassigned), and multiple instances of nucleotide changes that reassigned the domain to a different DUF1220 subtype. These corrections resulted in an overall increase in DUF1220 copy number, yielding a haploid total of 289 copies. Approximately 20 of these new DUF1220 copies were the result of a segmental duplication from 1q21.2 to 1p11.2 that included two NBPF genes. Interestingly, this duplication may have been the catalyst for the evolutionarily important human lineage-specific chromosome 1 pericentric inversion.ConclusionsThrough the hydatidiform mole genome sequencing effort, the 1q21.1-q21.2 region is complete and misassemblies involving inter- and intra-region duplications have been resolved. The availability of this single haploid sequence path will aid in the investigation of many genetic diseases linked to 1q21, including several associated with DUF1220 copy number variations. Finally, the corrected sequence identified a recent segmental duplication that added 20 additional DUF1220 copies to the human genome, and may have facilitated the chromosome 1 pericentric inversion that is among the most notable human-specific genomic landmarks.

Highlights

  • The reference human genome sequence was declared finished in 2003, some regions of the genome remain incomplete due to their complex architecture

  • While confirmation testing conducted on the assembly indicates that the CHM1 library is representative of the general population, it cannot be ruled out that some of the changes may be true variants within the human population

  • This investigation demonstrated that resources developed from a haploid hydatidiform mole genome could effectively be used to complete assembly of chromosome 1q21, one of the most complex and evolutionarily dynamic regions of the human genome

Read more

Summary

Introduction

The reference human genome sequence was declared finished in 2003, some regions of the genome remain incomplete due to their complex architecture. Elucidation of the exact variants behind these associations has been hampered by the repetitive nature of the region and its incomplete assembly This region contains 238 of the 270 human DUF1220 protein domains, which are implicated in human brain evolution and neurodevelopment. Completion of an accurate assembly is a continuing challenge given the presence of multiple highly duplicated and complex regions that remain largely intractable to analysis with commonly used assembly techniques [1]. Finishing these regions has significant implications for identifying causative disease loci and in turn efficacious treatments for patients with genetic and genomic disorders [2]. The 2009 human genome assembly reflects this challenge, containing 14 gaps in the 7.7 Mb 1q21.1-2 region

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call