Relative model selection of evolutionary substitution models can be sensitive to multiple sequence alignment uncertainty

Stephanie J Spielman,Molly L Miraglia

doi:10.1186/s12862-021-01931-5

Stephanie J Spielman, Molly L Miraglia

Open Access

https://doi.org/10.1186/s12862-021-01931-5

Copy DOI

Abstract

BackgroundMultiple sequence alignments (MSAs) represent the fundamental unit of data inputted to most comparative sequence analyses. In phylogenetic analyses in particular, errors in MSA construction have the potential to induce further errors in downstream analyses such as phylogenetic reconstruction itself, ancestral state reconstruction, and divergence time estimation. In addition to providing phylogenetic methods with an MSA to analyze, researchers must also specify a suitable evolutionary model for the given analysis. Most commonly, researchers apply relative model selection to select a model from candidate set and then provide both the MSA and the selected model as input to subsequent analyses. While the influence of MSA errors has been explored for most stages of phylogenetics pipelines, the potential effects of MSA uncertainty on the relative model selection procedure itself have not been explored.ResultsWe assessed the consistency of relative model selection when presented with multiple perturbed versions of a given MSA. We find that while relative model selection is mostly robust to MSA uncertainty, in a substantial proportion of circumstances, relative model selection identifies distinct best-fitting models from different MSAs created from the same set of sequences. We find that this issue is more pervasive for nucleotide data compared to amino-acid data. However, we also find that it is challenging to predict whether relative model selection will be robust or sensitive to uncertainty in a given MSA.ConclusionsWe find that that MSA uncertainty can affect virtually all steps of phylogenetic analysis pipelines to a greater extent than has previously been recognized, including relative model selection.

Highlights

Multiple sequence alignments (MSAs) represent the fundamental unit of data inputted to most comparative sequence analyses
One of the most common approaches used to identify a suitable model for phylogenetic inference is relative model selection, wherein a set of candidate models are ranked according to a given goodness-of-fit measurement, and the best-fitting model is used in the phylogenetic reconstruction [34]
We broadly found that there is potential for model selection, in particular on nucleotide data, to identify different best-fitting evolutionary models for different MSA versions created from the same ortholog set

Summary

Introduction

Multiple sequence alignments (MSAs) represent the fundamental unit of data inputted to most comparative sequence analyses. In addition to providing phylogenetic methods with an MSA to analyze, researchers must specify a suitable evolutionary model for the given analysis. While the influence of MSA errors has been explored for most stages of phylogenetics pipelines, the potential effects of MSA uncertainty on the relative model selection procedure itself have not been explored. While the effects of MSA uncertainty in phylogenetic pipelines have been heavily studied, the MSA is not the Spielman and Miraglia BMC Ecology and Evolution (2021) 21:214 only piece of information that is inputted to phylogenetic reconstruction and other evolutionary-informed analyses. Recent studies have suggested that relative model selection may not be a critical step in phylogenetic studies [2, 31, 33], it remains an enduring staple of most analysis pipelines. We use the phrase “model selection” to refer to relative model selection, unless otherwise stated

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Ecology and Evolution	Publication Date: Nov 29, 2021
Citations: 2	License type: open-access

R Discovery Prime

R Discovery Prime

Relative model selection of evolutionary substitution models can be sensitive to multiple sequence alignment uncertainty

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Ecology and Evolution

Lead the way for us

Similar Papers

Re-Evaluating Botryosphaeriales: Ancestral State Reconstructions of Selected Characters and Evolution of Nutritional Modes
Achala R Rathnayaka ... Jian-Kui Liu
Journal of Fungi | VOL. 9
Achala R Rathnayaka, et. al.Achala R Rathnayaka ... Jian-Kui Liu
29 Jan 2023
Journal of Fungi | VOL. 9

Author response: Comprehensive phylogenetic analysis of the ribonucleotide reductase family reveals an ancestral clade
Audrey A Burnim ... Colin J Jackson
-
Audrey A Burnim, et. al.Audrey A Burnim ... Colin J Jackson
11 Aug 2022
11 Aug 2022

Effects of Phylogenetic Signal on Ancestral State Reconstruction
Glenn Litsios ... Nicolas Salamin
Systematic Biology | VOL. 61
Glenn Litsios, et. al.Glenn Litsios ... Nicolas Salamin
04 Jan 2012
Systematic Biology | VOL. 61

Implications of gene tree heterogeneity on downstream phylogenetic analyses: A case study employing the Fair Proportion index.
Kristina Wicke ... Laura Kubatko
PloS one | VOL. 19
Kristina Wicke, et. al.Kristina Wicke ... Laura Kubatko
25 Apr 2024
PloS one | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Relative model selection of evolutionary substitution models can be sensitive to multiple sequence alignment uncertainty

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Ecology and Evolution