Task Of Parallel Programming Research Articles

The use of parallel computing tools can significantly reduce the execution time of calculations in many engineering tasks. One of the main difficulties in the development of multithreaded programs remains the organization of simultaneous access from different threads to shared data. The most common solution to this problem is to use locking facilities when accessing shared data. There are a number of tasks where data sharing is not needed, but you need to synchronize access to a limited resource, such as a temporary buffer. In such tasks, there is no data exchange between different threads, but there is an object that at a given time can be used by the code of only one thread. One such task is calculating the value of a B-spline. The software implementation of the functions for calculating B-splines, performed according to classical algorithms, requires the use of blocking objects when accessing the common array of intermediate data from different threads. This reduces the degree of parallelism and reduces the efficiency of computational programs using B-splines running on multiprocessor computing systems. The article discusses a way to improve the efficiency of calculating B-splines in parallel programming tasks by eliminating locks when accessing general modified data. A software implementation is presented in the form of a C++ class template, which provides placement of a temporary array used for calculating a B-spline into a local buffer of a given size with the possibility of increasing it if necessary. Using the developed template in conjunction with the threadlocal qualifier reduces the number of requests for increasing the buffer for high degree B-splines (larger than the initially specified buffer size). It is also possible to implement this scheme using the std::vector template of the C++ STL Standard Library. The results of the application of the developed class when calculating the values of B-splines in a multithreaded environment, showing a reduction in the calculation time in proportion to an increase in the number of computational processors, are presented. The methods of specifying arrays for storing intermediate calculation results considered in this article can be used in other parallel programming tasks.

Bioinformatics is an emerging field, where information technology usage can significantly accelerate life science research. It is a relatively new field and the scope of exploring new tools and techniques seems immense. One major field where bioinformatics plays important role is next generation sequence analysis (NGS), in which an unknown genome is shuttered into pieces and tried to align it to a reference known genome to decipher its functions using sequence comparison. The first well known application of this technology is the human genome project which took nearly 10 years to finish. With advancements in central processing units (CPUs), the alignment time has improved, but has not reached optimal. There seems a constant need to improve this computing time, which made the scope for using graphics processing units (GPUs) and parallel programming tasks to replace CPUs. With access to high performance multi-thread, multi-core parallel computing supercomputers, several GPU based sequence alignment tools have been published recently, some of the major tools are BarraCUDA, CUSHAW, GPU-BWT, SOAP3, and SARUMAN, which claim to speed up the processes anywhere between 2x and 10x times. Most of these tools can be compiled on GCC 4.3 compilers with CUDA. This paper focuses on compiling the current GPU based alignment tools on 70.7 million read pairs (Illumina HiSeq 2000) to align them on a human genome and check its efficiency (time sensitivity and alignment specificity) compared to traditional CPU based alignment (Bowtie) tool. Resulting observations would help researchers choose the appropriate GPU alignment tool to suffice their computing needs. Key words: CUDA, sequencing, alignment, graphics processing units (GPUs), central processing units (CPUs).

Task Of Parallel Programming Research Articles

Related Topics

Articles published on Task Of Parallel Programming

Performance and programmability of GrPPI for parallel stream processing on multi-cores

Improving the Efficiency of Calculating B-splines in parallel Programming Tasks

A Comparison of Scheduling parallel program tasks based on Java Applet

Using Thread Local Memory for Calculating B-Splines in Parallel Programming Tasks

English

Parallel programming with Easy Java Simulations

Program Transformation to Identify List-Based Parallel Skeletons

SpiceC

Predicting the execution times of parallel-independent programs using Pearson distributions

Main sequences genetic scheduling for multiprocessor systems using task duplication

Scheduling tasks of a parallel program in two-processor systems with use of cellular automata

Genetics-based multiprocessor scheduling using task duplication

A Parallel Matrix Class Library in C++ for Computational Mechanics Applications

A message-passing class library C++ for portable parallel programming

Object-oriented parallel programming tools for structural engineering applications

A quasi-optimal cluster allocation strategy for parallel program execution in distributed systems using genetic algorithms

Scheduling of precedence-constrained parallel program tasks on multiprocessors

Translating from FP to Occam for systolic algorithms

Scheduling parallel program tasks onto arbitrary target machines

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Task Of Parallel Programming Research Articles

Related Topics

Articles published on Task Of Parallel Programming

Performance and programmability of GrPPI for parallel stream processing on multi-cores

Improving the Efficiency of Calculating B-splines in parallel Programming Tasks

A Comparison of Scheduling parallel program tasks based on Java Applet

Using Thread Local Memory for Calculating B-Splines in Parallel Programming Tasks

English

Parallel programming with Easy Java Simulations

Program Transformation to Identify List-Based Parallel Skeletons

SpiceC

Predicting the execution times of parallel-independent programs using Pearson distributions

Main sequences genetic scheduling for multiprocessor systems using task duplication

Scheduling tasks of a parallel program in two-processor systems with use of cellular automata

Genetics-based multiprocessor scheduling using task duplication

A Parallel Matrix Class Library in C++ for Computational Mechanics Applications

A message-passing class library C++ for portable parallel programming

Object-oriented parallel programming tools for structural engineering applications

A quasi-optimal cluster allocation strategy for parallel program execution in distributed systems using genetic algorithms

Scheduling of precedence-constrained parallel program tasks on multiprocessors

Translating from FP to Occam for systolic algorithms

Scheduling parallel program tasks onto arbitrary target machines