Stampy: A statistical algorithm for sensitive and fast mapping of Illumina sequence reads

Gerton Lunter,Martin Goodson

doi:10.1101/gr.111120.110

Abstract

High-volume sequencing of DNA and RNA is now within reach of any research laboratory and is quickly becoming established as a key research tool. In many workflows, each of the short sequences ("reads") resulting from a sequencing run are first "mapped" (aligned) to a reference sequence to infer the read from which the genomic location derived, a challenging task because of the high data volumes and often large genomes. Existing read mapping software excel in either speed (e.g., BWA, Bowtie, ELAND) or sensitivity (e.g., Novoalign), but not in both. In addition, performance often deteriorates in the presence of sequence variation, particularly so for short insertions and deletions (indels). Here, we present a read mapper, Stampy, which uses a hybrid mapping algorithm and a detailed statistical model to achieve both speed and sensitivity, particularly when reads include sequence variation. This results in a higher useable sequence yield and improved accuracy compared to that of existing software.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Stampy: A statistical algorithm for sensitive and fast mapping of Illumina sequence reads

Abstract

Talk to us

Similar Papers

More From: Genome Research

Lead the way for us

Journal: Genome Research	Publication Date: Oct 27, 2010
Citations: 1137

Similar Papers

Atm sequence variants are predictive of adverse radiotherapy response among patients treated for prostate cancer
J.A Cesaretti ... B.S Rosenstein
International Journal of Radiation Oncology*Biology*Physics | VOL. 60
J.A Cesaretti, et. al.J.A Cesaretti ... B.S Rosenstein
01 Sep 2004
International Journal of Radiation Oncology*Biology*Physics | VOL. 60

Sequence variation and the transcriptional activity of the upstream regulatory region in human papillomavirus 16 E7 variants in cervical cancer of Korean women
Yong Kim ... Yong Song
Oncology Reports | VOL. 14
Yong Kim, et. al.Yong Kim ... Yong Song
01 Aug 2005
Oncology Reports | VOL. 14

HPV-16 E2 gene disruption and sequence variation in CIN 3 lesions and invasive squamous cell carcinomas of the cervix: relation to numerical chromosome abnormalities
D A Graham
Molecular Pathology | VOL. 53
D A GrahamD A Graham
01 Aug 2000
Molecular Pathology | VOL. 53

Sequence variations, flanking region mutations, and allele frequency at 31 autosomal STRs in the central Indian population by next generation sequencing (NGS)
Hirak Ranjan Dash ... Surajit Das
Scientific Reports | VOL. 11
Hirak Ranjan Dash, et. al.Hirak Ranjan Dash ... Surajit Das
01 Dec 2021
Scientific Reports | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Stampy: A statistical algorithm for sensitive and fast mapping of Illumina sequence reads

Abstract

Talk to us

Similar Papers

More From: Genome Research