GCViT: a method for interactive, genome-wide visualization of resequencing and SNP array data

Andrew P Wilkey,Steven B Cannon,Ethalinda K S Cannon,Anne V Brown

doi:10.1186/s12864-020-07217-2

Abstract

BackgroundLarge genotyping datasets have become commonplace due to efficient, cheap methods for SNP identification. Typical genotyping datasets may have thousands to millions of data points per accession, across tens to thousands of accessions. There is a need for tools to help rapidly explore such datasets, to assess characteristics such as overall differences between accessions and regional anomalies across the genome.ResultsWe present GCViT (Genotype Comparison Visualization Tool), for visualizing and exploring large genotyping datasets. GCViT can be used to identify introgressions, conserved or divergent genomic regions, pedigrees, and other features for more detailed exploration. The program can be used online or as a local instance for whole genome visualization of resequencing or SNP array data. The program performs comparisons of variants among user-selected accessions to identify allele differences and similarities between accessions and a user-selected reference, providing visualizations through histogram, heatmap, or haplotype views. The resulting analyses and images can be exported in various formats.ConclusionsGCViT provides methods for interactively visualizing SNP data on a whole genome scale, and can produce publication-ready figures. It can be used in online or local installations. GCViT enables users to confirm or identify genomics regions of interest associated with particular traits.GCViT is freely available at https://github.com/LegumeFederation/gcvit. The 1.0 version described here is available at https://doi.org/10.5281/zenodo.4008713.

Highlights

Large genotyping datasets have become commonplace due to efficient, cheap methods for SNP identification
In this paper we describe a new tool, GCViT (Genotype Comparison Visualization Tool) for dynamic, whole genome visualization of resequencing and SNP array data through histogram, heatmap or haplotype views of two or more accessions selected from a genotyping data set
Instructions for deploying an instance of GCViT are provided in the GitHub repository

Summary

Introduction

Large genotyping datasets have become commonplace due to efficient, cheap methods for SNP identification. Re-sequencing and SNP-array projects are used to identify sequence variants between multiple lines, and may be used to perform genome wide association studies (GWAS) to find variants that are associated with phenotypes. These studies can produce millions of SNPs. For example, Torkamaneh et al [1]. The command line tool Genotype Query Tools (GQT) [2] and its web form, webGQT [3] provide a means of indexing and querying VCF files. Some of these tools include: Wilkey et al BMC Genomics (2020) 21:822

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Genomics	Publication Date: Nov 23, 2020
Citations: 4	License type: open-access

R Discovery Prime

R Discovery Prime

GCViT: a method for interactive, genome-wide visualization of resequencing and SNP array data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics

Lead the way for us

Similar Papers

Deducing genotypes for loci of interest from SNP array data via haplotype sharing, demonstrated for apple and cherry.
Alexander Schaller ... Cameron Peace
PLOS ONE | VOL. 18
Alexander Schaller, et. al.Alexander Schaller ... Cameron Peace
07 Feb 2023
PLOS ONE | VOL. 18

The genomic basis of parallel ecological speciation
Frederico Roda
-
Frederico RodaFrederico Roda
30 Jan 2015
30 Jan 2015

Divergence hitchhiking and the spread of genomic isolation during ecological speciation-with-gene-flow
Sara Via
Philosophical Transactions of the Royal Society B: Biological Sciences | VOL. 367
Sara ViaSara Via
05 Feb 2012
Philosophical Transactions of the Royal Society B: Biological Sciences | VOL. 367

Chromosomal inversions associated with environmental adaptation in honeybees.
Matthew J Christmas ... Anna Olsson
Molecular Ecology | VOL. 28
Matthew J Christmas, et. al.Matthew J Christmas ... Anna Olsson
21 Dec 2018
Molecular Ecology | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

GCViT: a method for interactive, genome-wide visualization of resequencing and SNP array data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics