Multithreaded Programs Research Articles

Design of an efficient thread-safe concurrent data structure is a balancing act between its implementation complexity and performance. Lock-based concurrent data structures, which are relatively easy to derive from their sequential counterparts and to prove thread-safe, suffer from poor throughput under even light multi-threaded workload. At the same time, lock-free concurrent structures allow for high throughput, but are notoriously difficult to get right and require careful reasoning to formally establish their correctness. In this work, we explore a solution to this conundrum based on a relatively old idea of batch parallelism---an approach for designing high-throughput concurrent data structures via a simple insight: efficiently processing a batch of a priori known operations in parallel is easier than optimising performance for a stream of arbitrary asynchronous requests. Alas, batch-parallel structures have not seen wide practical adoption due to (i) the inconvenience of having to structure multi-threaded programs to explicitly group operations and (ii) the lack of a systematic methodology to implement batch-parallel structures as simply as lock-based ones. We present OBatcher---a Multicore OCaml library that streamlines the design, implementation, and usage of batch-parallel structures. OBatcher solves the first challenge (how to use) by suggesting a new lightweight implicit batching design pattern that is built on top of generic asynchronous programming mechanisms. The second challenge (how to implement) is addressed by identifying a family of strategies for converting common sequential structures into the corresponding efficient batch-parallel versions, and by providing a library of functors that embody those strategies. We showcase OBatcher with a diverse set of benchmarks ranging from Red-Black and AVL trees to van Emde Boas trees, skip lists, and a thread-safe implementation of a Datalog solver. Our evaluation of all the implementations on large asynchronous workloads shows that (a) they consistently outperform the corresponding coarse-grained lock-based implementations---the only ones available in OCaml to date, and that (b) their throughput scales reasonably with the number of processors.

Read full abstract

Modern multi-core systems are most effective when used in large server centers and for cloud computing. However, despite the known complexity of software implemen-tation, parallel computing on multiprocessors is increasingly used in computer model-ling. Advanced mechanisms of synchronous and multithreaded programming are in-creasingly used to improve the productivity of numerical studies, reducing the time of computer models implementation. One such mechanism is coroutines, a convenient tool for managing asynchronous operations introduced in the C++20 standard. A special feature of coroutines is the ability to suspend a function at a certain stage, saving its state, and after some time resume its execution from the previous stop. The aim of this research is to improve the performance of computer modelling by using coroutines and data threads. As a result of the work, a test algorithm for multiplying a matrix by a vector and its modified asynchronous version using the coroutine mechanism and splitting into two data threads was developed, which allowed to achieve 1.94 times increase in the com-puting speed when the matrix dimension is 15000 (2.25×106 elements). It has been found that at a small matrix dimension, the developed asynchronous algorithm using coroutines and splitting into two threads is less efficient than the single thread algo-rithm. This is due to the fact that the compiler needs some time to create threads and start execution simultaneously. With a large dimensionality, the performance of the asynchronous algorithm increases significantly. With a matrix dimension of more than 1200, the use of an asynchronous algorithm divided into two threads is guaranteed to be more efficient than a single-threaded. The data obtained are consistent with the results of similar studies of the problem of increasing the efficiency of computer modelling using alternative software and hard-ware. The new method of solving the problems of asynchronous programming provides a more efficient and simple mechanism for managing asynchronous operations.

Read full abstract

Multithreaded Programs Research Articles

Related Topics

Articles published on Multithreaded Programs

Preparing MPICH for exascale

Data Race Freedom à la Mode

The digest framework: concurrency-sensitivity for abstract interpretation

Migrating from Developing Asynchronous Multi-Threading Programs to Reactive Programs in Java

Pac-Sim: Simulation of Multi-threaded Workloads using Intelligent, Live Sampling

Development of a 3D Virtual Welding Simulator Using Weld Bead Created by Voxelization Technique

Jmvx: Fast Multi-threaded Multi-version Execution and Record-Replay for Managed Languages

Concurrent Data Structures Made Easy

A new approach for software-simulation of membrane systems using a multi-thread programming model

LabVIEW-based rotary balance data synchronization acquisition system design

An Axiomatic Theory for Reversible Computation

Дослідження проблем синхронізації та захисту даних при реалізації багатопоточних програм

Introducing Multi-Threaded Programming in Parallel Programming Process for Optimal Performance Results

Synchronous Deterministic Parallel Programming for Multi-Cores with ForeC

A parallel particle swarm optimization framework based on a fork-join thread pool using a work-stealing mechanism

Davida: A Decentralization Approach to Localizing Transaction Sequences for Debugging Transactional Atomicity Violations

Evaluation of the efficiency of implementation of asynchronous computing algorithms using coroutines and threads in С++

Satisfiability Modulo Ordering Consistency Theory for SC, TSO, and PSO Memory Models

Parallel Sparse Computation Toolkit

Design and Implementation of a Simulator for Precise WCET Estimation of Multithreaded Program

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multithreaded Programs Research Articles

Related Topics

Articles published on Multithreaded Programs

Preparing MPICH for exascale

Data Race Freedom à la Mode

The digest framework: concurrency-sensitivity for abstract interpretation

Migrating from Developing Asynchronous Multi-Threading Programs to Reactive Programs in Java

Pac-Sim: Simulation of Multi-threaded Workloads using Intelligent, Live Sampling

Development of a 3D Virtual Welding Simulator Using Weld Bead Created by Voxelization Technique

Jmvx: Fast Multi-threaded Multi-version Execution and Record-Replay for Managed Languages

Concurrent Data Structures Made Easy

A new approach for software-simulation of membrane systems using a multi-thread programming model

LabVIEW-based rotary balance data synchronization acquisition system design

An Axiomatic Theory for Reversible Computation

Дослідження проблем синхронізації та захисту даних при реалізації багатопоточних програм

Introducing Multi-Threaded Programming in Parallel Programming Process for Optimal Performance Results

Synchronous Deterministic Parallel Programming for Multi-Cores with ForeC

A parallel particle swarm optimization framework based on a fork-join thread pool using a work-stealing mechanism

Davida: A Decentralization Approach to Localizing Transaction Sequences for Debugging Transactional Atomicity Violations

Evaluation of the efficiency of implementation of asynchronous computing algorithms using coroutines and threads in С++

Satisfiability Modulo Ordering Consistency Theory for SC, TSO, and PSO Memory Models

Parallel Sparse Computation Toolkit

Design and Implementation of a Simulator for Precise WCET Estimation of Multithreaded Program