Fragmentary Gene Sequences Negatively Impact Gene Tree and Species Tree Reconstruction.

Erfan Sayyari,Siavash Mirarab,James B Whitfield

doi:10.1093/molbev/msx261

Abstract

Species tree reconstruction from genome-wide data is increasingly being attempted, in most cases using a two-step approach of first estimating individual gene trees and then summarizing them to obtain a species tree. The accuracy of this approach, which promises to account for gene tree discordance, depends on the quality of the inferred gene trees. At the same time, phylogenomic and phylotranscriptomic analyses typically use involved bioinformatics pipelines for data preparation. Errors and shortcomings resulting from these preprocessing steps may impact the species tree analyses at the other end of the pipeline. In this article, we first show that the presence of fragmentary data for some species in a gene alignment, as often seen on real data, can result in substantial deterioration of gene trees, and as a result, the species tree. We then investigate a simple filtering strategy where individual fragmentary sequences are removed from individual genes but the rest of the gene is retained. Both in simulations and by reanalyzing a large insect phylotranscriptomic data set, we show the effectiveness of this simple filtering strategy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fragmentary Gene Sequences Negatively Impact Gene Tree and Species Tree Reconstruction.

Abstract

Talk to us

Similar Papers

More From: Molecular Biology and Evolution

Lead the way for us

Journal: Molecular Biology and Evolution	Publication Date: Oct 4, 2017
Citations: 73

Similar Papers

Gene tree reconstruction and orthology analysis based on an integrated model for duplications and sequence evolution
Lars Arvestad ... Bengt Sennblad
-
Lars Arvestad, et. al.Lars Arvestad ... Bengt Sennblad
01 Jan 2004
01 Jan 2004

Genomic Characterization and Curation of UCEs Improves Species Tree Reconstruction
Matthew H Van Dam ... Michelle Trautwein
Systematic Biology | VOL. 70
Matthew H Van Dam, et. al.Matthew H Van Dam ... Michelle Trautwein
04 Aug 2020
Systematic Biology | VOL. 70

Measuring Branch Support in Species Trees Obtained by Gene Tree Parsimony
Simon Joly ... Anne Bruneau
Systematic Biology | VOL. 58
Simon Joly, et. al.Simon Joly ... Anne Bruneau
01 Feb 2009
Systematic Biology | VOL. 58

The inference of gene trees with species trees.
Gergely J Szöllősi ... Bastien Boussau
Systematic Biology | VOL. 64
Gergely J Szöllősi, et. al.Gergely J Szöllősi ... Bastien Boussau
28 Jul 2014
Systematic Biology | VOL. 64

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fragmentary Gene Sequences Negatively Impact Gene Tree and Species Tree Reconstruction.

Abstract

Talk to us

Similar Papers

More From: Molecular Biology and Evolution