Data-driven guidelines for phylogenomic analyses using SNP data.

Jacob S Suissa,Gisel Y De La Cerda,Leland C Graber,Chloe Jelley,David Wickell,Heather R Phillips,Ayress D Grinage,Corrie S Moreau,Chelsea D Specht,Jeff J Doyle,Jacob B Landis

doi:10.1002/aps3.11611

Abstract

There is a general lack of consensus on the best practices for filtering of single-nucleotide polymorphisms (SNPs) and whether it is better to use SNPs or include flanking regions (full "locus") in phylogenomic analyses and subsequent comparative methods. Using genotyping-by-sequencing data from 22 Glycine species, we assessed the effects of SNP vs. locus usage and SNP retention stringency. We compared branch length, node support, and divergence time estimation across 16 datasets with varying amounts of missing data and total size. Our results revealed five aspects of phylogenomic data usage that may be generally applicable: (1) tree topology is largely congruent across analyses; (2) filtering strictly for SNP retention (e.g., 90-100%) reduces support and can alter some inferred relationships; (3) absolute branch lengths vary by two orders of magnitude between SNP and locus datasets; (4) data type and branch length variation have little effect on divergence time estimation; and (5) phylograms alter the estimation of ancestral states and rates of morphological evolution. Using SNP or locus datasets does not alter phylogenetic inference significantly, unless researchers want or need to use absolute branch lengths. We recommend against using excessive filtering thresholds for SNP retention to reduce the risk of producing inconsistent topologies and generating low support.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Data-driven guidelines for phylogenomic analyses using SNP data.

Abstract

Talk to us

Similar Papers

More From: Applications in plant sciences

Lead the way for us

Journal: Applications in plant sciences	Publication Date: Aug 9, 2024
License type: CC BY 4.0

Similar Papers

CORRELATED RATES OF MOLECULAR AND MORPHOLOGICAL EVOLUTION.
Kevin E Omland
Evolution | VOL. 51
Kevin E OmlandKevin E Omland
01 Oct 1997
Evolution | VOL. 51

A comparison of SNP and STR loci for delineating population structure and performing individual genetic assignment
Kevin A Glover ... Sigbjørn Lien
BMC Genetics | VOL. 11
Kevin A Glover, et. al.Kevin A Glover ... Sigbjørn Lien
06 Jan 2010
BMC Genetics | VOL. 11

The impact of single nucleotide polymorphism selection on prediction of genomewide breeding values
Kacper Żukowski ... Joanna Szyda
BMC Proceedings | VOL. 3
Kacper Żukowski, et. al.Kacper Żukowski ... Joanna Szyda
23 Feb 2009
BMC Proceedings | VOL. 3

Genome sequencing and conservation genomics in the Scandinavian wolverine population.
Robert Ekblom ... Hans Ellegren
Conservation Biology | VOL. 32
Robert Ekblom, et. al.Robert Ekblom ... Hans Ellegren
07 Sep 2018
Conservation Biology | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data-driven guidelines for phylogenomic analyses using SNP data.

Abstract

Talk to us

Similar Papers

More From: Applications in plant sciences