Missing Data and Influential Sites: Choice of Sites for Phylogenetic Analysis Can Be As Important As Taxon Sampling and Model Choice

Liat Shavit Grievink,David Penny,Barbara R Holland

doi:10.1093/gbe/evt032

Liat Shavit Grievink, David Penny + Show 1 more

Open Access

https://doi.org/10.1093/gbe/evt032

Copy DOI

Abstract

Phylogenetic studies based on molecular sequence alignments are expected to become more accurate as the number of sites in the alignments increases. With the advent of genomic-scale data, where alignments have very large numbers of sites, bootstrap values close to 100% and posterior probabilities close to 1 are the norm, suggesting that the number of sites is now seldom a limiting factor on phylogenetic accuracy. This provokes the question, should we be fussy about the sites we choose to include in a genomic-scale phylogenetic analysis? If some sites contain missing data, ambiguous character states, or gaps, then why not just throw them away before conducting the phylogenetic analysis? Indeed, this is exactly the approach taken in many phylogenetic studies. Here, we present an example where the decision on how to treat sites with missing data is of equal importance to decisions on taxon sampling and model choice, and we introduce a graphical method for illustrating this.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Genome Biology and Evolution	Publication Date: Mar 6, 2013
Citations: 32	License type: cc-by-nc

R Discovery Prime

R Discovery Prime

Missing Data and Influential Sites: Choice of Sites for Phylogenetic Analysis Can Be As Important As Taxon Sampling and Model Choice

Abstract

Talk to us

Similar Papers

More From: Genome Biology and Evolution

Lead the way for us

Similar Papers

Incomplete taxa, incomplete characters, and phylogenetic accuracy: is there a missing data problem?
John J Wiens
Journal of Vertebrate Paleontology | VOL. 23
John J WiensJohn J Wiens
17 Jun 2003
Journal of Vertebrate Paleontology | VOL. 23

Does adding characters with missing data increase or decrease phylogenetic accuracy?
John J Wiens
Systematic Biology | VOL. 47
John J WiensJohn J Wiens
30 Dec 1998
Systematic Biology | VOL. 47

The Use and Validity of Composite Taxa in Phylogenetic Analysis
Véronique Campbell ... François-Joseph Lapointe
Systematic Biology | VOL. 58
Véronique Campbell, et. al.Véronique Campbell ... François-Joseph Lapointe
21 Sep 2009
Systematic Biology | VOL. 58

Should genes with missing data be excluded from phylogenetic analyses?
Wei Jiang ... John J Wiens
Molecular Phylogenetics and Evolution | VOL. 80
Wei Jiang, et. al.Wei Jiang ... John J Wiens
11 Aug 2014
Molecular Phylogenetics and Evolution | VOL. 80

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Missing Data and Influential Sites: Choice of Sites for Phylogenetic Analysis Can Be As Important As Taxon Sampling and Model Choice

Abstract

Talk to us

Similar Papers

More From: Genome Biology and Evolution