Abstract

Background: Ever since the development of first next-generation genome sequencer (NGS) in 2005, there are rapid developments of high throughput next-generation genome sequencing (HT-NGS) techniques and tools used in genetics and genomics has become much more comfortable and cheaper. The result is the generation of a massive amount of data sets, requiring detailed analysis, which becomes impossible without the use of appropriate bioinformatics tools. One of the crucial steps in the analysis of NGS data is to map readings to a reference sequence. Although the dominance of Illumina synthesis by sequencing (SBS) technology has been noticeable in recent years, the choice of the tools is hampered and the variety of input data and reference genomes. Moreover, the tools used are crucial for result files and further analysis.Methods: The subject of this paper is the three most frequently used alignment mapping programs, which have functions to allow working with many platforms: BWA, Bowtie2 and SMALT. The task of the tested aligners is to match short sequences coming from NGS with reference sequences. The most popular: BWA and Bowtie2 use for this purpose the Burrows-Wheeler transformation and SMALT maps the sequences using hashing and dynamic programming. The presented paper aimed to compare the quality and efficiency of the alignment mapping programs under examination, due to three criteria: i) the quality of the compared sequences of different lengths and from different platforms; ii) coefficient of wrongly compared sequences; iii) the computational resources used.Results: By comparing the results of the mapping analyses for all the programs used, the least popular SMALT is the best. Obtaining the highest percentage of mapped readings for each platform and maintaining the lowest computational memory usage, turns out to be the most optimal choice.Conclusions: The results presented in this paper can be used to verify and rebuild data analysis pipelines from NGS based so far on other tools. We conclude that by using the tools under appropriate conditions, it is possible to improve the quality of the analyses, speed them up and reduce their cost.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call