Algorithms: simultaneous error-correction and rooting for gene tree reconciliation and the gene duplication problem

Pawel Górecki,Oliver Eulenstein

doi:10.1186/1471-2105-13-s10-s14

Pawel Górecki, Oliver Eulenstein

Open Access

https://doi.org/10.1186/1471-2105-13-s10-s14

Copy DOI

Journal: BMC bioinformatics	Publication Date: Jun 1, 2012
Citations: 38	License type: CC BY 2.0

Affiliation: University of Warsaw, Iowa State University

Abstract

BackgroundEvolutionary methods are increasingly challenged by the wealth of fast growing resources of genomic sequence information. Evolutionary events, like gene duplication, loss, and deep coalescence, account more then ever for incongruence between gene trees and the actual species tree. Gene tree reconciliation is addressing this fundamental problem by invoking the minimum number of gene duplication and losses that reconcile a rooted gene tree with a rooted species tree. However, the reconciliation process is highly sensitive to topological error or wrong rooting of the gene tree, a condition that is not met by most gene trees in practice. Thus, despite the promises of gene tree reconciliation, its applicability in practice is severely limited.ResultsWe introduce the problem of reconciling unrooted and erroneous gene trees by simultaneously rooting and error-correcting them, and describe an efficient algorithm for this problem. Moreover, we introduce an error-corrected version of the gene duplication problem, a standard application of gene tree reconciliation. We introduce an effective heuristic for our error-corrected version of the gene duplication problem, given that the original version of this problem is NP-hard. Our experimental results suggest that our error-correcting approaches for unrooted input trees can significantly improve on the accuracy of gene tree reconciliation, and the species tree inference under the gene duplication problem. Furthermore, the efficiency of our algorithm for error-correcting reconciliation is capable of handling truly large-scale phylogenetic studies.ConclusionsOur presented error-correction approach is a crucial step towards making gene tree reconciliation more robust, and thus to improve on the accuracy of applications that fundamentally rely on gene tree reconciliation, like the inference of gene-duplication supertrees.

Highlights

Introduction to unrooted reconciliationHere we highlight some results from [18] that are used for the design of our algorithm
Most inference methods used in practice return only unrooted gene trees that have to be rooted for the gene tree reconciliation process
First we describe our algorithm for computing the optimal cost and the set of optimal edges after one nearest neighbor interchange (NNI) operation performed on an unrooted gene tree, and extend it to a general case with k NNI operations

Summary

Introduction

Introduction to unrooted reconciliationHere we highlight some results from [18] that are used for the design of our algorithm. Gene tree reconciliation is addressing this fundamental problem by invoking the minimum number of gene duplication and losses that reconcile a rooted gene tree with a rooted species tree. Topological error results in an incorrect topology of the gene tree that can be caused by the inference process (e.g. noise in the underlying sequence data) or the inference method itself (e.g. heuristic results). This problem has been addressed for rooted gene trees by ‘correcting the error’; that is, editing the given tree such that the number of invoked gene-duplications and losses is minimized [16,17]. Rooting problems can be bypassed by identifying roots that minimize the invoked number of gene duplications and losses [7,16,17,18,19]

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Algorithms: simultaneous error-correction and rooting for gene tree reconciliation and the gene duplication problem

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC bioinformatics

Lead the way for us

Similar Papers

A Linear Time Algorithm for Error-Corrected Reconciliation of Unrooted Gene Trees
Paweł Górecki ... Oliver Eulenstein
-
Paweł Górecki, et. al.Paweł Górecki ... Oliver Eulenstein
01 Jan 2010
01 Jan 2010

Inferring Optimal Species Trees in the Presence of Gene Duplication and Loss: Beyond Rooted Gene Trees.
Md. Shamsuzzoha Bayzid
Journal of computational biology : a journal of computational molecular cell biology | VOL. 30
Md. Shamsuzzoha BayzidMd. Shamsuzzoha Bayzid
13 Oct 2022
Journal of computational biology : a journal of computational molecular cell biology | VOL. 30

From Gene Trees to Species Trees II: Species Tree Inference by Minimizing Deep Coalescence Events
Louxin Zhang
IEEE/ACM transactions on computational biology and bioinformatics | VOL. 8
Louxin Zhang Louxin Zhang
01 Nov 2011
IEEE/ACM transactions on computational biology and bioinformatics | VOL. 8

Deep Coalescence Reconciliation with Unrooted Gene Trees: Linear Time Algorithms
Paweł Górecki ... Oliver Eulenstein
-
Paweł Górecki, et. al.Paweł Górecki ... Oliver Eulenstein
01 Jan 2012
01 Jan 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Algorithms: simultaneous error-correction and rooting for gene tree reconciliation and the gene duplication problem

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC bioinformatics