Abstract

BackgroundDivergence in gene structure following gene duplication is not well understood. Gene duplication can occur via whole-genome duplication (WGD) and single-gene duplications including tandem, proximal and transposed duplications. Different modes of gene duplication may be associated with different types, levels, and patterns of structural divergence.ResultsIn Arabidopsis thaliana, we denote levels of structural divergence between duplicated genes by differences in coding-region lengths and average exon lengths, and the number of insertions/deletions (indels) and maximum indel length in their protein sequence alignment. Among recent duplicates of different modes, transposed duplicates diverge most dramatically in gene structure. In transposed duplications, parental loci tend to have longer coding-regions and exons, and smaller numbers of indels and maximum indel lengths than transposed loci, reflecting biased structural changes in transposed duplications. Structural divergence increases with evolutionary time for WGDs, but not transposed duplications, possibly because of biased gene losses following transposed duplications. Structural divergence has heterogeneous relationships with nucleotide substitution rates, but is consistently positively correlated with gene expression divergence. The NBS-LRR gene family shows higher-than-average levels of structural divergence.ConclusionsOur study suggests that structural divergence between duplicated genes is greatly affected by the mechanisms of gene duplication and may be not proportional to evolutionary time, and that certain gene families are under selection on rapid evolution of gene structure.

Highlights

  • IntroductionGene duplication can occur via whole-genome duplication (WGD) and single-gene duplications including tandem, proximal and transposed duplications

  • Divergence in gene structure following gene duplication is not well understood

  • We found that parental loci generally have longer coding-regions and exons, and fewer indels with smaller maximum indel lengths than transposed loci (Figure 3), suggesting that transposed duplications tend to be associated with biased changes in gene structure

Read more

Summary

Introduction

Gene duplication can occur via whole-genome duplication (WGD) and single-gene duplications including tandem, proximal and transposed duplications. Gene duplication may occur by different modes such as whole-genome duplication (WGD) [2] and single-gene duplications [3,4,5]. Single-gene duplications including local (tandem or proximal) and dispersed duplications contribute to the origin of a substantial portion of Arabidopsis genes [5,7,8]. More recent models for gene retention include genetic buffering [12], functional redundancy [13,14,15], dosage balance constraints [5,16,17], or need for enhanced expression levels [18,19]

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call