Mammalian endogenous retroviruses (ERVs) are ancient retroviruses that have been integrated into genomes. ERVs were believed to be inactive until the discovery of ERV transcription in the mouse genome. However, the transcription level and function of ERV elements in mammalian genomes are not well understood. In this study, we performed the first genome-wide scanning of ERV loci in the American mink (Neogale vison) genome (NeoERV) followed by transcriptomic analysis to detect actively transcribed NeoERV elements. A total of 365,791 NeoERV loci were identified, and161,205 (44%) of these loci were found to be actively transcribed based on transcriptomic data from three types of tissues (amygdala, trachea and lung). More than one third of the actively transcribed NeoERV loci were tissue-specific. Furthermore, some of the active loci were associated with host gene transcription, and the level of NeoERV transcription was positively correlated with that of host genes, specifically when active loci were located in overlapped gene regions. An in-depth analysis of the envelope protein coding env gene showed that, in general, its transcription level was higher than that of NeoERVs, which is believed to be associated with host immunity.
Read full abstract