Kmer-SSR: a fast and exhaustive SSR search algorithm.

Brandon D Pickett,Perry G Ridge,Justin B Miller

doi:10.1093/bioinformatics/btx538

Abstract

MotivationOne of the main challenges with bioinformatics software is that the size and complexity of datasets necessitate trading speed for accuracy, or completeness. To combat this problem of computational complexity, a plethora of heuristic algorithms have arisen that report a ‘good enough’ solution to biological questions. However, in instances such as Simple Sequence Repeats (SSRs), a ‘good enough’ solution may not accurately portray results in population genetics, phylogenetics and forensics, which require accurate SSRs to calculate intra- and inter-species interactions.ResultsWe present Kmer-SSR, which finds all SSRs faster than most heuristic SSR identification algorithms in a parallelized, easy-to-use manner. The exhaustive Kmer-SSR option has 100% precision and 100% recall and accurately identifies every SSR of any specified length. To identify more biologically pertinent SSRs, we also developed several filters that allow users to easily view a subset of SSRs based on user input. Kmer-SSR, coupled with the filter options, accurately and intuitively identifies SSRs quickly and in a more user-friendly manner than any other SSR identification algorithm.Availability and implementationThe source code is freely available on GitHub at https://github.com/ridgelab/Kmer-SSR.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Bioinformatics	Publication Date: Aug 30, 2017
Citations: 22	License type: CC BY-NC 4.0

R Discovery Prime

R Discovery Prime

Kmer-SSR: a fast and exhaustive SSR search algorithm.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Similar Papers

RISA: a new web-tool for Rapid Identification of SSRs and Analysis of primers
Jungeun Kim ... Sang-Keun Oh
Genes & Genomics | VOL. 34
Jungeun Kim, et. al.Jungeun Kim ... Sang-Keun Oh
01 Dec 2012
Genes & Genomics | VOL. 34

Multiresolution descriptor matching algorithm for fast exhaustive search in norm-sorted databases
Jong Beom Ra
Journal of Electronic Imaging | VOL. 14
Jong Beom RaJong Beom Ra
01 Oct 2005
Journal of Electronic Imaging | VOL. 14

Genetic diversity and population structure of Distylium chinense revealed by ISSR and SRAP analysis in the Three Gorges Reservoir Region of the Yangtze River, China
Ling Xiang ... Cheng-Ming Huang
Global Ecology and Conservation | VOL. 21
Ling Xiang, et. al.Ling Xiang ... Cheng-Ming Huang
17 Oct 2019
Global Ecology and Conservation | VOL. 21

Assessment of genetic diversity and population structure in gladiolus (Gladiolus hybridus Hort.) by ISSR markers.
Veena Chaudhary ... Shailendra Sharma
Physiology and Molecular Biology of Plants | VOL. 24
Veena Chaudhary, et. al.Veena Chaudhary ... Shailendra Sharma
07 Apr 2018
Physiology and Molecular Biology of Plants | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Kmer-SSR: a fast and exhaustive SSR search algorithm.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics