The Influence of the Number of Tree Searches on Maximum Likelihood Inference in Phylogenomics.

Chao Liu,Chao Liu,Chris Todd Hittinger,Ronghui Pan,Jinyan Huang,Antonis Rokas,Xue-Xin Chen,Xing-Xing Shen,Xing-Xing Shen,Yun Chen,Xiaofan Zhou,Yuanning Li,Yuanning Li

doi:10.1093/sysbio/syae031

Abstract

Maximum likelihood (ML) phylogenetic inference is widely used in phylogenomics. As heuristic searches most likely find suboptimal trees, it is recommended to conduct multiple (e.g., 10) tree searches in phylogenetic analyses. However, beyond its positive role, how and to what extent multiple tree searches aid ML phylogenetic inference remains poorly explored. Here, we found that a random starting tree was not as effective as the BioNJ and parsimony starting trees in inferring the ML gene tree and that RAxML-NG and PhyML were less sensitive to different starting trees than IQ-TREE. We then examined the effect of the number of tree searches on ML tree inference with IQ-TREE and RAxML-NG, by running 100 tree searches on 19,414 gene alignments from 15 animal, plant, and fungal phylogenomic datasets. We found that the number of tree searches substantially impacted the recovery of the best-of-100 ML gene tree topology among 100 searches for a given ML program. In addition, all of the concatenation-based trees were topologically identical if the number of tree searches was ≥10. Quartet-based ASTRAL trees inferred from 1 to 80 tree searches differed topologically from those inferred from 100 tree searches for 6/15 phylogenomic datasets. Finally, our simulations showed that gene alignments with lower difficulty scores had a higher chance of finding the best-of-100 gene tree topology and were more likely to yield the correct trees.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The Influence of the Number of Tree Searches on Maximum Likelihood Inference in Phylogenomics.

Abstract

Talk to us

Similar Papers

More From: Systematic biology

Lead the way for us

Similar Papers

The Free Lunch is not over yet-systematic exploration of numerical thresholds in maximum likelihood phylogenetic inference.
Julia Haag ... Lukas Hübner
Bioinformatics Advances | VOL. 3
Julia Haag, et. al.Julia Haag ... Lukas Hübner
05 Jan 2023
Bioinformatics Advances | VOL. 3

MorePhyML: Improving the phylogenetic tree space exploration with PhyML 3
Alexis Criscuolo
Molecular Phylogenetics and Evolution | VOL. 61
Alexis CriscuoloAlexis Criscuolo
08 Sep 2011
Molecular Phylogenetics and Evolution | VOL. 61

STELLS2: fast and accurate coalescent-based maximum likelihood inference of species trees from gene tree topologies
Jingwen Pei ... Yufeng Wu
Bioinformatics | VOL. 33
Jingwen Pei, et. al.Jingwen Pei ... Yufeng Wu
10 Feb 2017
Bioinformatics | VOL. 33

An investigation of irreproducibility in maximum likelihood phylogenetic inference
Xing-Xing Shen ... Xue-Xin Chen
Nature Communications | VOL. 11
Xing-Xing Shen, et. al.Xing-Xing Shen ... Xue-Xin Chen
30 Nov 2020
Nature Communications | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Influence of the Number of Tree Searches on Maximum Likelihood Inference in Phylogenomics.

Abstract

Talk to us

Similar Papers

More From: Systematic biology