Abstract

BackgroundTransposable elements are major players in genome evolution. Transposon insertion polymorphisms can translate into phenotypic differences in plants and animals and are linked to different diseases including human cancer, making their characterization highly relevant to the study of genome evolution and genetic diseases.ResultsHere we present Jitterbug, a novel tool that identifies transposable element insertion sites at single-nucleotide resolution based on the pairedend mapping and clipped-read signatures produced by NGS alignments. Jitterbug can be easily integrated into existing NGS analysis pipelines, using the standard BAM format produced by frequently applied alignment tools (e.g. bwa, bowtie2), with no need to realign reads to a set of consensus transposon sequences. Jitterbug is highly sensitive and able to recall transposon insertions with a very high specificity, as demonstrated by benchmarks in the human and Arabidopsis genomes, and validation using long PacBio reads. In addition, Jitterbug estimates the zygosity of transposon insertions with high accuracy and can also identify somatic insertions.ConclusionsWe demonstrate that Jitterbug can identify mosaic somatic transposon movement using sequenced tumor-normal sample pairs and allows for estimating the cancer cell fraction of clones containing a somatic TE insertion. We suggest that the independent methods we use to evaluate performance are a step towards creating a gold standard dataset for benchmarking structural variant prediction tools.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-015-1975-5) contains supplementary material, which is available to authorized users.

Highlights

  • Transposable elements are major players in genome evolution

  • Increased Transposable Element (TE) activity in neurons may lead to diseases such as schizophrenia [13] and TE insertion (TEI) have been associated to other human diseases such as hepatocellular carcinoma [14], lung squamous, head and neck, colorectal and endometrial carcinomas [15], as well as to other cancer types [16]

  • Jitterbug allows for identification of TEIs with an allelic frequency (AF) substantially below 0.5, such as the case of somatic mutations occurring in subclones of a tumor and present in cancer cell fractions below 100 %

Read more

Summary

Introduction

Transposon insertion polymorphisms can translate into phenotypic differences in plants and animals and are linked to different diseases including human cancer, making their characterization highly relevant to the study of genome evolution and genetic diseases. Transposable elements (TEs) are mobile genetic elements that account for an important fraction of both plant and animal genomes. The analysis of TE insertion polymorphisms is an important component in studying the evolution of plant and animal genomes and is highly relevant in the context of elucidating the genetic basis of disease, including cancers. The identification of somatic TEIs in cancer is highly relevant to the study of disease evolution, and remains a challenge as it requires highly sensitive methods able to identify TEIs in a minor fraction of cells (i.e. tumor sub-clones)

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call