The Asymptotic Behavior of Bootstrap Support Values in Molecular Phylogenetics.

Jun Huang,Yuting Liu,Tianqi Zhu,Ziheng Yang

doi:10.1093/sysbio/syaa100

Abstract

The phylogenetic bootstrap is the most commonly used method for assessing statistical confidence in estimated phylogenies by non-Bayesian methods such as maximum parsimony and maximum likelihood (ML). It is observed that bootstrap support tends to be high in large genomic data sets whether or not the inferred trees and clades are correct. Here, we study the asymptotic behavior of bootstrap support for the ML tree in large data sets when the competing phylogenetic trees are equally right or equally wrong. We consider phylogenetic reconstruction as a problem of statistical model selection when the compared models are nonnested and misspecified. The bootstrap is found to have qualitatively different dynamics from Bayesian inference and does not exhibit the polarized behavior of posterior model probabilities, consistent with the empirical observation that the bootstrap is more conservative than Bayesian probabilities. Nevertheless, bootstrap support similarly shows fluctuations among large data sets, with no convergence to a point value, when the compared models are equally right or equally wrong. Thus, in large data sets strong support for wrong trees or models is likely to occur. Our analysis provides a partial explanation for the high bootstrap support values for incorrect clades observed in empirical data analysis. [Bootstrap; model selection; star-tree paradox; support value.].

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The Asymptotic Behavior of Bootstrap Support Values in Molecular Phylogenetics.

Abstract

Talk to us

Similar Papers

More From: Systematic biology

Lead the way for us

Journal: Systematic biology	Publication Date: Dec 30, 2020
Citations: 7

Similar Papers

Does Choice in Model Selection Affect Maximum Likelihood Analysis?
Jennifer Ripplinger ... Jack Sullivan
Systematic Biology | VOL. 57
Jennifer Ripplinger, et. al.Jennifer Ripplinger ... Jack Sullivan
01 Feb 2008
Systematic Biology | VOL. 57

Estimating amino acid substitution models from genome datasets: a simulation study on the performance of estimated models.
Nguyen Huy Tinh ... Le Sy Vinh
Journal of Evolutionary Biology | VOL. 37
Nguyen Huy Tinh, et. al.Nguyen Huy Tinh ... Le Sy Vinh
12 Dec 2023
Journal of Evolutionary Biology | VOL. 37

Turning Vice into Virtue: Using Batch-Effects to Detect Errors in Large Genomic Data Sets.
Fabrizio Mafessoni ... Leif Groop
Genome Biology and Evolution | VOL. 10
Fabrizio Mafessoni, et. al.Fabrizio Mafessoni ... Leif Groop
10 Sep 2018
Genome Biology and Evolution | VOL. 10

Evaluation on genetic relationships among China’s endemic Curcuma L. herbs by mtDNA
Deng Jb ... Ft Zhang
Phyton | VOL. 87
Deng Jb, et. al.Deng Jb ... Ft Zhang
01 Jan 2018
Phyton | VOL. 87

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Asymptotic Behavior of Bootstrap Support Values in Molecular Phylogenetics.

Abstract

Talk to us

Similar Papers

More From: Systematic biology