Abstract

BackgroundGene duplication, followed by functional evolution of duplicate genes, is a primary engine of evolutionary innovation. In turn, gene expression evolution is a critical component of overall functional evolution of paralogs. Inferring evolutionary history of gene expression among paralogs is therefore a problem of considerable interest. It also represents significant challenges. The standard approaches of evolutionary reconstruction assume that at an internal node of the duplication tree, the two duplicates evolve independently. However, because of various selection pressures functional evolution of the two paralogs may be coupled. The coupling of paralog evolution corresponds to three major fates of gene duplicates: subfunctionalization (SF), conserved function (CF) or neofunctionalization (NF). Quantitative analysis of these fates is of great interest and clearly influences evolutionary inference of expression. These two interrelated problems of inferring gene expression and evolutionary fates of gene duplicates have not been studied together previously and motivate the present study.ResultsHere we propose a novel probabilistic framework and algorithm to simultaneously infer (i) ancestral gene expression and (ii) the likely fate (SF, NF, CF) at each duplication event during the evolution of gene family. Using tissue-specific gene expression data, we develop a nonparametric belief propagation (NBP) algorithm to predict the ancestral expression level as a proxy for function, and describe a novel probabilistic model that relates the predicted and known expression levels to the possible evolutionary fates. We validate our model using simulation and then apply it to a genome-wide set of gene duplicates in human.ConclusionsOur results suggest that SF tends to be more frequent at the earlier stage of gene family expansion, while NF occurs more frequently later on.

Highlights

  • Gene duplication, followed by functional evolution of duplicate genes, is a primary engine of evolutionary innovation

  • We propose a new description of SF, conserved function (CF) and NF using tissue-specific gene expression level as a surrogate for gene function

  • We propose a novel probabilistic model to quantify the three evolutionary fates including SF, CF and NF based on the predicted ancestral function from the phylogenetic tree using the nonparametric belief propagation algorithm

Read more

Summary

Introduction

Gene duplication, followed by functional evolution of duplicate genes, is a primary engine of evolutionary innovation. The coupling of paralog evolution corresponds to three major fates of gene duplicates: subfunctionalization (SF), conserved function (CF) or neofunctionalization (NF) Quantitative analysis of these fates is of great interest and clearly influences evolutionary inference of expression. Several proxies have been used to represent gene function including protein-protein interactions (PPI) [5,6,7], regulatory networks [8], fitness effect [9], metabolic networks [10,11], genetic interactions [12], and gene expression patterns [13,14,15,16] Each of these alternatives necessitates a different model to quantify the three fates in the functional evolution of paralogs. In [17], the ancestral expression is set to equal-weighted average of descendent expression values Such approaches assume that the duplicate genes diversify independently, which is not the case as the postulated evolutionary fates of duplicates–SF, NF, CF–implicitly assume coupled evolution.

Objectives
Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.