Let Them Fall Where They May: Congruence Analysis in Massive Phylogenetically Messy Data Sets

J W Leigh,E Bapteste,P Lopez,K Schliep

doi:10.1093/molbev/msr110

Abstract

Interest in congruence in phylogenetic data has largely focused on issues affecting multicellular organisms, and animals in particular, in which the level of incongruence is expected to be relatively low. In addition, assessment methods developed in the past have been designed for reasonably small numbers of loci and scale poorly for larger data sets. However, there are currently over a thousand complete genome sequences available and of interest to evolutionary biologists, and these sequences are predominantly from microbial organisms, whose molecular evolution is much less frequently tree-like than that of multicellular life forms. As such, the level of incongruence in these data is expected to be high. We present a congruence method that accommodates both very large numbers of genes and high degrees of incongruence. Our method uses clustering algorithms to identify subsets of genes based on similarity of phylogenetic signal. It involves only a single phylogenetic analysis per gene, and therefore, computation time scales nearly linearly with the number of genes in the data set. We show that our method performs very well with sets of sequence alignments simulated under a wide variety of conditions. In addition, we present an analysis of core genes of prokaryotes, often assumed to have been largely vertically inherited, in which we identify two highly incongruent classes of genes. This result is consistent with the complexity hypothesis.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Let Them Fall Where They May: Congruence Analysis in Massive Phylogenetically Messy Data Sets

Abstract

Talk to us

Similar Papers

More From: Molecular Biology and Evolution

Lead the way for us

Journal: Molecular Biology and Evolution	Publication Date: Apr 27, 2011
Citations: 46

Similar Papers

Genome data vs MLST for exploring intraspecific evolutionary history in bacteria: Much is not always better.
Noelia Floridia-Yapur ... Patricio Diosque
Infection, Genetics and Evolution | VOL. 93
Noelia Floridia-Yapur, et. al.Noelia Floridia-Yapur ... Patricio Diosque
01 Sep 2021
Infection, Genetics and Evolution | VOL. 93

Data decisiveness, data quality, and incongruence in phylogenetic analysis: an example from the monocotyledons using mitochondrial atp A sequences.
Jerrold I Davis ... D Cannatella
Systematic biology | VOL. 47
Jerrold I Davis, et. al.Jerrold I Davis ... D Cannatella
01 Jun 1998
Systematic biology | VOL. 47

Genetics, experience, and host-plant preference in Eurosta solidaginis: implications for host shifts and speciation.
Timothy P Craig ... Joanne K Itami
Evolution; international journal of organic evolution | VOL. 55
Timothy P Craig, et. al.Timothy P Craig ... Joanne K Itami
01 Jan 2001
Evolution; international journal of organic evolution | VOL. 55

GENETICS, EXPERIENCE, AND HOST-PLANT PREFERENCE IN EUROSTA SOLIDAGINIS: IMPLICATIONS FOR HOST SHIFTS AND SPECIATION
Timothy P Craig ... John D Horner
Evolution | VOL. 55
Timothy P Craig, et. al.Timothy P Craig ... John D Horner
09 May 2007
Evolution | VOL. 55

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Let Them Fall Where They May: Congruence Analysis in Massive Phylogenetically Messy Data Sets

Abstract

Talk to us

Similar Papers

More From: Molecular Biology and Evolution