Abstract

Endogenous retroviruses (ERVs) are a class of transposable elements found in all vertebrate genomes that contribute substantially to genomic functional and structural diversity. A host species acquires an ERV when an exogenous retrovirus infects a germ cell of an individual and becomes part of the genome inherited by viable progeny. ERVs that colonized ancestral lineages are fixed in contemporary species. However, in some extant species, ERV colonization is ongoing, which results in variation in ERV frequency in the population. To study the consequences of ERV colonization of a host genome, methods are needed to assign each ERV to a location in a species’ genome and determine which individuals have acquired each ERV by descent. Because well annotated reference genomes are not widely available for all species, de novo clustering approaches provide an alternative to reference mapping that are insensitive to differences between query and reference and that are amenable to mobile element studies in both model and non-model organisms. However, there is substantial uncertainty in both identifying ERV genomic position and assigning each unique ERV integration site to individuals in a population. We present an analysis suitable for detecting ERV integration sites in species without the need for a reference genome. Our approach is based on improved de novo clustering methods and statistical models that take the uncertainty of assignment into account and yield a probability matrix of shared ERV integration sites among individuals. We demonstrate that polymorphic integrations of a recently identified endogenous retrovirus in deer reflect contemporary relationships among individuals and populations.

Highlights

  • Transposable elements (TEs) comprise a significant proportion of all eukaryotic species’ genomes examined to date [1,2,3]

  • Instead of taking an arbitrary cutoff value to determine whether CrERV i is carried by animal j, we address the uncertainty of CrERV status for small counts by a two-component mixture model (Poisson and truncated Geometric)

  • We recently described a new member of the Gammaretrovirus family (CrERV) that is colonizing its vertebrate host and is transcriptionally active

Read more

Summary

Introduction

Transposable elements (TEs) comprise a significant proportion of all eukaryotic species’ genomes examined to date [1,2,3]. New acquisitions or mobilization of existing elements can result in variation in their number and genome location among species [4]. ERVs in particular can have profound impacts on host genome evolution and on host phenotype by altering host gene expression and by inducing structural variation in the host genome. Many ERVs no longer have the ability to replicate, they continue to contribute to host gene regulation [5,6,7,8]. Their presence in the genome provides evidence of colonization events at specific points along the ancestral lineage leading to a contemporary species [9]

Objectives
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.