Excluding Loci With Substitution Saturation Improves Inferences From Phylogenomic Data.

David A Duchêne,Cara Van Der Wal,Simon Y W Ho,Niklas Mather

doi:10.1093/sysbio/syab075

David A Duchêne, Cara Van Der Wal + Show 2 more

Open Access

https://doi.org/10.1093/sysbio/syab075

Copy DOI

Journal: Systematic Biology	Publication Date: Sep 11, 2021
Citations: 26	License type: CC BY 4.0

Affiliation: University of Copenhagen, University of Sydney

Abstract

The historical signal in nucleotide sequences becomes eroded over time by substitutions occurring repeatedly at the same sites. This phenomenon, known as substitution saturation, is recognized as one of the primary obstacles to deep-time phylogenetic inference using genome-scale data sets. We present a new test of substitution saturation and demonstrate its performance in simulated and empirical data. For some of the 36 empirical phylogenomic data sets that we examined, we detect substitution saturation in around 50% of loci. We found that saturation tends to be flagged as problematic in loci with highly discordant phylogenetic signals across sites. Within each data set, the loci with smaller numbers of informative sites are more likely to be flagged as containing problematic levels of saturation. The entropy saturation test proposed here is sensitive to high evolutionary rates relative to the evolutionary timeframe, while also being sensitive to several factors known to mislead phylogenetic inference, including short internal branches relative to external branches, short nucleotide sequences, and tree imbalance. Our study demonstrates that excluding loci with substitution saturation can be an effective means of mitigating the negative impact of multiple substitutions on phylogenetic inferences. [Phylogenetic model performance; phylogenomics; substitution model; substitution saturation; test statistics.]

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Excluding Loci With Substitution Saturation Improves Inferences From Phylogenomic Data.

Abstract

Talk to us

Similar Papers

More From: Systematic Biology

Lead the way for us

Similar Papers

Differences in Performance among Test Statistics for Assessing Phylogenomic Model Adequacy.
David A Duchêne ... Simon Y W Ho
Genome Biology and Evolution | VOL. 10
David A Duchêne, et. al.David A Duchêne ... Simon Y W Ho
18 May 2018
Genome Biology and Evolution | VOL. 10

Quartet-Based Computations of Internode Certainty Provide Robust Measures of Phylogenetic Incongruence.
Xiaofan Zhou ... Antonis Rokas
Systematic Biology | VOL. 69
Xiaofan Zhou, et. al.Xiaofan Zhou ... Antonis Rokas
29 Aug 2019
Systematic Biology | VOL. 69

No background in biology is assumed
Andrew V Z Brower
Cladistics | VOL. 36
Andrew V Z BrowerAndrew V Z Brower
25 Feb 2020
Cladistics | VOL. 36

Editor's evaluation: TMS-evoked responses are driven by recurrent large-scale network dynamics
Alex Fornito
-
Alex FornitoAlex Fornito
20 Oct 2022
20 Oct 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Excluding Loci With Substitution Saturation Improves Inferences From Phylogenomic Data.

Abstract

Talk to us

Similar Papers

More From: Systematic Biology