Abstract

BackgroundA critical topic of insertional mutagenesis experiments performed on model organisms is mapping the hits of artificial transposons (ATs) at nucleotide level accuracy. Mapping errors may occur when sequencing artifacts or mutations as single nucleotide polymorphisms (SNPs) and small indels are present very close to the junction between a genomic sequence and a transposon inverted repeat (TIR). Another particular item of insertional mutagenesis is mapping of the transposon self-insertions and, to our best knowledge, there is no publicly available mapping tool designed to analyze such molecular events.ResultsWe developed Genome ARTIST, a pairwise gapped aligner tool which works out both issues by means of an original, robust mapping strategy. Genome ARTIST is not designed to use next-generation sequencing (NGS) data but to analyze ATs insertions obtained in small to medium-scale mutagenesis experiments. Genome ARTIST employs a heuristic approach to find DNA sequence similarities and harnesses a multi-step implementation of a Smith-Waterman adapted algorithm to compute the mapping alignments. The experience is enhanced by easily customizable parameters and a user-friendly interface that describes the genomic landscape surrounding the insertion. Genome ARTIST is functional with many genomes of bacteria and eukaryotes available in Ensembl and GenBank repositories. Our tool specifically harnesses the sequence annotation data provided by FlyBase for Drosophila melanogaster (the fruit fly), which enables mapping of insertions relative to various genomic features such as natural transposons. Genome ARTIST was tested against other alignment tools using relevant query sequences derived from the D. melanogaster and Mus musculus (mouse) genomes. Real and simulated query sequences were also comparatively inquired, revealing that Genome ARTIST is a very robust solution for mapping transposon insertions.ConclusionsGenome ARTIST is a stand-alone user-friendly application, designed for high-accuracy mapping of transposon insertions and self-insertions. The tool is also useful for routine aligning assessments like detection of SNPs or checking the specificity of primers and probes. Genome ARTIST is an open source software and is available for download at www.genomeartist.ro and at GitHub (https://github.com/genomeartist/genomeartist ).Electronic supplementary materialThe online version of this article (doi:10.1186/s13100-016-0061-0) contains supplementary material, which is available to authorized users.

Highlights

  • A critical topic of insertional mutagenesis experiments performed on model organisms is mapping the hits of artificial transposons (ATs) at nucleotide level accuracy

  • A second rule is that if the nucleus of a transposon partial alignment (TPA) happens to overlap the nucleus of a genomic partial alignment (GPA), the shared nucleus fragment is allotted to the transposon in the final mapping result

  • The general performances of Genome ARTIST were tested with 39 original sequences derived by inverse PCR (iPCR) inquiry of D. melanogaster mutant strains obtained in our laboratory by mobilization of P{lacW} and P{EP} artificial transposons with a Δ2-3 transposase source [19]

Read more

Summary

Results

The general performances of Genome ARTIST were tested with 39 original sequences derived by iPCR inquiry of D. melanogaster mutant strains obtained in our laboratory by mobilization of P{lacW} and P{EP} artificial transposons with a Δ2-3 transposase source [19]. Within the table the numerical values reveal the accuracy of the mapping results (precise mappings) reported by Genome ARTIST and iMapper for various patterns of simulated small-scale deletions and SNPs in the TSD region from six sets of 102 query sequences standing for virtual P{lacW} insertions considering the same simulations It appears that SSAHA, BLAST and BLAT aligners fail to accurately map the genomic sequences containing terminal smallscale mutations if the transposon sequences are removed from the composite query. We simulated genomic small-scale mutations very close to the TIRs of 23 real and of 102 virtual P{lacW} insertions located in D. melanogaster genome This approach was intended to comparatively test the robustness of Genome ARTIST to map ATs insertions when affected by polymorphisms and/ or by sequencing artifacts as compared to the similar achievements of iMapper, BLAST and BLAT. Genome ARTIST is an open-source software which runs on many flavors of Linux OS and perfectly fits the popular BioLinux workbench

Conclusions
Background
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.