DivStat: a user-friendly tool for single nucleotide polymorphism analysis of genomic diversity.

Inês Soares,Ana Moleirinho,Gonçalo N P Oliveira,António Amorim

doi:10.1371/journal.pone.0119851

Abstract

Recent developments have led to an enormous increase of publicly available large genomic data, including complete genomes. The 1000 Genomes Project was a major contributor, releasing the results of sequencing a large number of individual genomes, and allowing for a myriad of large scale studies on human genetic variation. However, the tools currently available are insufficient when the goal concerns some analyses of data sets encompassing more than hundreds of base pairs and when considering haplotype sequences of single nucleotide polymorphisms (SNPs). Here, we present a new and potent tool to deal with large data sets allowing the computation of a variety of summary statistics of population genetic data, increasing the speed of data analysis.

Highlights

The most widely-used software packages, such as DnaSP [1] and Arlequin [2] cannot handle the data formats adopted by massive re-sequencing projects
The development of potent tools to analyze the genetic variation of large scale data stored in the variant call format (VCF) developed by the 1000 Genomes Project that has been adopted by other projects, such as UK10K, dbSNP and the NHLBI Exome Project, became imperative [3,4,5]
The software here described represents a new tool to efficiently use, DNA sequences and polymorphism data, like those recently released in the VCF format

Summary

Introduction

The most widely-used software packages, such as DnaSP [1] and Arlequin [2] cannot handle the data formats adopted by massive re-sequencing projects. We have developed a new and robust algorithm, which runs on DivStat software, which uses the power of Linux/Unix, Macintosh and Windows environments, reducing the learning curve for those users less familiar with the shell commands. The program is implemented with a command line shell and with a user-friendly graphical interface that facilitates algorithm use. This tool can be applied to either polymorphism data or DNA sequences. It can compute sequentially a variety of summary statistics of population genetic data over a "sliding window”. The window is slid across the surveyed area and new similar

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PloS one	Publication Date: Mar 10, 2015
Citations: 12	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

DivStat: a user-friendly tool for single nucleotide polymorphism analysis of genomic diversity.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PloS one

Lead the way for us

Similar Papers

Protein Identification False Discovery Rates for Very Large Proteomics Data Sets Generated by Tandem Mass Spectrometry
Lukas Reiter ... Ruedi Aebersold
Molecular & Cellular Proteomics | VOL. 8
Lukas Reiter, et. al.Lukas Reiter ... Ruedi Aebersold
01 Nov 2009
Molecular & Cellular Proteomics | VOL. 8

A clustering method for very large mixed data sets
G Sanchez-Diaz ... J Ruiz-Shulcloper
-
G Sanchez-Diaz, et. al.G Sanchez-Diaz ... J Ruiz-Shulcloper
29 Nov 2001
29 Nov 2001

Implementing Large Genomic Single Nucleotide Polymorphism Data Sets in Phylogenetic Network Reconstructions: A Case Study of Particularly Rapid Radiations of Cichlid Fish.
Melisa Olave ... Axel Meyer
Systematic Biology | VOL. 69
Melisa Olave, et. al.Melisa Olave ... Axel Meyer
03 Feb 2020
Systematic Biology | VOL. 69

GenoSets: visual analytic methods for comparative genomics.
Aurora A Cain ... Gajendra P S Raghava
PloS one | VOL. 7
Aurora A Cain, et. al.Aurora A Cain ... Gajendra P S Raghava
03 Oct 2012
PloS one | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DivStat: a user-friendly tool for single nucleotide polymorphism analysis of genomic diversity.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PloS one