Abstract

Mice have emerged as one of the most popular and valuable model organisms in the research of human biology. This is due to their genetic and physiological similarity to humans, short generation times, availability of genetically homologous inbred strains, and relatively easy laboratory maintenance. Therefore, following the release of the initial human reference genome, the generation of the mouse reference genome was prioritised and represented an important scientific resource for the mouse genetics community. In 2002, the Mouse Genome Sequencing Consortium published an initial draft of the mouse reference genome which contained ~ 96% of the euchromatic genome of female C57BL/6 J mice. Almost two decades on from the publication of the initial draft, sequencing efforts have continued to increase the completeness and accuracy of the C57BL/6 J reference genome alongside advances in genome annotation. Additionally new sequencing technologies have provided a wealth of data that has added to the repertoire of annotations associated with traditional genomic annotations. Including but not limited to advances in regulatory elements, the 3D genome and individual cellular states. In this review we focus on the reference genome C57BL/6 J and summarise the different aspects of genomic and cellular annotations, as well as their relevance to mouse genetic research. We denote a genomic annotation as a functional unit of the genome. Cellular annotations are annotations of cell type or state, defined by the transcriptomic expression profile of a cell. Due to the wide-ranging number and diversity of annotations describing the mouse genome, we focus on gene, repeat and regulatory element annotation as well as two relatively new technologies; 3D genome architecture and single-cell sequencing outlining their utility in genetic research and their current challenges.

Highlights

  • Good annotations are reliant on the accuracy and high quality of the reference genome assembly

  • The Genome Reference Consortium is responsible for building, improving and providing the mouse genome assembly to the scientific community

  • The GENCODE resource is based at the European Bioinformatics Institute (EBI-EBML)

Read more

Summary

Established annotations in the mouse reference genome

Good annotations are reliant on the accuracy and high quality of the reference genome assembly. Regulatory elements tightly control the spatio-temporal expression of each gene, giving rise to an abundance of different cell types They offer a critical layer of information in understanding how the same set of gene annotations, which exist in almost every cell, can give rise to complex multicellular organisms like mice and humans. One of which is the functional annotation of the mammalian genome project (FANTOM5) which generated Cap Analysis of Gene Expression (CAGE) seq in mouse and human cells (Lizio et al 2015, 2019) This allowed the precise annotation of known promoter locations and the identification of new promoters, as well as annotation of promoter activity in different cell types (measured by expression level of CAGE peaks). One of the most popular is the Encyclopedia of DNA Elements project (ENCODE) (Davis et al 2018), which has coordinated an effort to generate datasets to analyse primary order and higher order chromatin architecture, transcription, DNA methylation, histone modification, transcription factor occupancy and

Chromatin loop
Findings
Whole Mouse Brain
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call