A Fast, Reproducible, High-throughput Variant Calling Workflow for Population Genomics.

Cade D Mirchandani,Timothy B Sackton,Russ Corbett-Detig,Erik Enbody,Mara Baylis,Brian Arnold,Gregg W C Thomas,Sara J Smith,Allison J Shultz

doi:10.1093/molbev/msad270

Abstract

The increasing availability of genomic resequencing data sets and high-quality reference genomes across the tree of life present exciting opportunities for comparative population genomic studies. However, substantial challenges prevent the simple reuse of data across different studies and species, arising from variability in variant calling pipelines, data quality, and the need for computationally intensive reanalysis. Here, we present snpArcher, a flexible and highly efficient workflow designed for the analysis of genomic resequencing data in nonmodel organisms. snpArcher provides a standardized variant calling pipeline and includes modules for variant quality control, data visualization, variant filtering, and other downstream analyses. Implemented in Snakemake, snpArcher is user-friendly, reproducible, and designed to be compatible with high-performance computing clusters and cloud environments. To demonstrate the flexibility of this pipeline, we applied snpArcher to 26 public resequencing data sets from nonmammalian vertebrates. These variant data sets are hosted publicly to enable future comparative population genomic analyses. With its extensibility and the availability of public data sets, snpArcher will contribute to a broader understanding of genetic variation across species by facilitating the rapid use and reuse of large genomic data sets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Molecular biology and evolution	Publication Date: Dec 9, 2023
Citations: 10	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Fast, Reproducible, High-throughput Variant Calling Workflow for Population Genomics.

Abstract

Talk to us

Similar Papers

More From: Molecular biology and evolution

Lead the way for us

Similar Papers

Positional bias in variant calls against draft reference assemblies
Roman V Briskine ... Kentaro K Shimizu
BMC Genomics | VOL. 18
Roman V Briskine, et. al.Roman V Briskine ... Kentaro K Shimizu
28 Mar 2017
BMC Genomics | VOL. 18

Abstract A1-41: Automated pipeline for high confidence variant calling and functional annotation, for matched tumor/normal samples sequenced by next-generation sequencing (NGS)
Susan M Grimes ... Stephanie Greer
Cancer Research | VOL. 75
Susan M Grimes, et. al.Susan M Grimes ... Stephanie Greer
15 Nov 2015
Cancer Research | VOL. 75

Systematic comparison of germline variant calling pipelines cross multiple next-generation sequencers
Jiayun Chen ... Hongbin Zhong
Scientific Reports | VOL. 9
Jiayun Chen, et. al.Jiayun Chen ... Hongbin Zhong
27 Jun 2019
Scientific Reports | VOL. 9

HashSeq: a Simple, Scalable, and Conservative De Novo Variant Caller for 16S rRNA Gene Data Sets.
Farnaz Fouladi ... Jacqueline B Young
mSystems | VOL. 6
Farnaz Fouladi, et. al.Farnaz Fouladi ... Jacqueline B Young
09 Nov 2021
mSystems | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Fast, Reproducible, High-throughput Variant Calling Workflow for Population Genomics.

Abstract

Talk to us

Similar Papers

More From: Molecular biology and evolution