Enumerating all maximal frequent subtrees in collections of phylogenetic trees.

David Fernández-Baca,Akshay Deepak

doi:10.1186/1748-7188-9-16

David Fernández-Baca, Akshay Deepak

Open Access

https://doi.org/10.1186/1748-7188-9-16

Copy DOI

Journal: Algorithms for molecular biology : AMB	Publication Date: Jun 18, 2014
Citations: 1	License type: CC BY 2.0

Affiliation: Iowa State University

Abstract

BackgroundA common problem in phylogenetic analysis is to identify frequent patterns in a collection of phylogenetic trees. The goal is, roughly, to find a subset of the species (taxa) on which all or some significant subset of the trees agree. One popular method to do so is through maximum agreement subtrees (MASTs). MASTs are also used, among other things, as a metric for comparing phylogenetic trees, computing congruence indices and to identify horizontal gene transfer events.ResultsWe give algorithms and experimental results for two approaches to identify common patterns in a collection of phylogenetic trees, one based on agreement subtrees, called maximal agreement subtrees, the other on frequent subtrees, called maximal frequent subtrees. These approaches can return subtrees on larger sets of taxa than MASTs, and can reveal new common phylogenetic relationships not present in either MASTs or the majority rule tree (a popular consensus method). Our current implementation is available on the web at https://code.google.com/p/mfst-miner/.ConclusionsOur computational results confirm that maximal agreement subtrees and all maximal frequent subtrees can reveal a more complete phylogenetic picture of the common patterns in collections of phylogenetic trees than maximum agreement subtrees; they are also often more resolved than the majority rule tree. Further, our experiments show that enumerating maximal frequent subtrees is considerably more practical than enumerating ordinary (not necessarily maximal) frequent subtrees.

Highlights

A common problem in phylogenetic analysis is to identify frequent patterns in a collection of phylogenetic trees
A phylogenetic tree is an unordered rooted tree whose leaves are in one-to-one correspondence with a set of species; its topology represents the hypothetical evolutionary relationships among these species
We evaluated the scalability of MFSTMINER with respect to the number of leaves (10-250), the number of trees (100-10000) and the support value (.51-1.0) on datasets having at least 250 leaves, i.e., datasets D (354 taxa) — Q (2554 taxa)

Summary

Results

We give algorithms and experimental results for two approaches to identify common patterns in a collection of phylogenetic trees, one based on agreement subtrees, called maximal agreement subtrees, the other on frequent subtrees, called maximal frequent subtrees. Our current implementation is available on the web at https://code.google.com/p/ mfst-miner/

Conclusions

Background

X prunes Y if either of the following holds:

If Txy is a result of a type-2 join and Tyz is not a result of a type-2 join

5: Add Txy to ETx

Results and discussion

15. Bryant D

33. Felsenstein J

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enumerating all maximal frequent subtrees in collections of phylogenetic trees.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Algorithms for molecular biology : AMB

Lead the way for us

Similar Papers

EvoMiner: frequent subtree mining in phylogenetic databases
Srikanta Tirthapura ... Michelle M Mcmahon
Knowledge and Information Systems | VOL. 41
Srikanta Tirthapura, et. al.Srikanta Tirthapura ... Michelle M Mcmahon
30 Jul 2013
Knowledge and Information Systems | VOL. 41

Mining closed and maximal frequent subtrees from databases of labeled rooted trees
Yi Xia ... R.R Muntz
IEEE Transactions on Knowledge and Data Engineering | VOL. 17
Yi Xia, et. al. Yi Xia ... R.R Muntz
01 Feb 2005
IEEE Transactions on Knowledge and Data Engineering | VOL. 17

Discovery of Useful Patterns from Tree-Structured Documents with Label-Projected Database
Hee Yong Youn ... Junghyun Nam
-
Hee Yong Youn, et. al.Hee Yong Youn ... Junghyun Nam
23 Jun 2008
23 Jun 2008

On compatibility and incompatibility of collections of unrooted phylogenetic trees
David Fernández-Baca ... Sudheer R Vakati
Discrete Applied Mathematics | VOL. 245
David Fernández-Baca, et. al.David Fernández-Baca ... Sudheer R Vakati
30 May 2017
Discrete Applied Mathematics | VOL. 245

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enumerating all maximal frequent subtrees in collections of phylogenetic trees.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Algorithms for molecular biology : AMB