Modern Multi-core Systems Research Articles

The Bellman operator constitutes the foundation of dynamic programming (DP). An alternative is presented by the Gauss-Seidel operator, whose evaluation, differently from that of the Bellman operator where the states are all processed at once, updates one state at a time, while incorporating into the computation the interim results. The provably better convergence rate of DP methods based on the Gauss-Seidel operator comes at the price of an inherent sequentiality, which prevents the exploitation of modern multi-core systems. In this work we propose a new operator for dynamic programming, namely, the <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">mini-batch Bellman operator</i> , which aims at realizing the trade-off between the better convergence rate of the methods based on the Gauss-Seidel operator and the parallelization capability offered by the Bellman operator. After the introduction of the new operator, a theoretical analysis for validating its fundamental properties is conducted. Such properties allow one to successfully deploy the new operator in the main dynamic programming schemes, such as value iteration and modified policy iteration. We compare the convergence of the DP algorithm based on the new operator with its earlier counterparts, shedding light on the algorithmic advantages of the new formulation and the impact of the batch-size parameter on the convergence. Finally, an extensive numerical evaluation of the newly introduced operator is conducted. In accordance with the theoretical derivations, the numerical results show the competitive performance of the proposed operator and its superior flexibility, which allows one to adapt the efficiency of its iterations to different structures of MDPs and hardware setups.

Modern multi-core systems are most effective when used in large server centers and for cloud computing. However, despite the known complexity of software implemen-tation, parallel computing on multiprocessors is increasingly used in computer model-ling. Advanced mechanisms of synchronous and multithreaded programming are in-creasingly used to improve the productivity of numerical studies, reducing the time of computer models implementation. One such mechanism is coroutines, a convenient tool for managing asynchronous operations introduced in the C++20 standard. A special feature of coroutines is the ability to suspend a function at a certain stage, saving its state, and after some time resume its execution from the previous stop. The aim of this research is to improve the performance of computer modelling by using coroutines and data threads. As a result of the work, a test algorithm for multiplying a matrix by a vector and its modified asynchronous version using the coroutine mechanism and splitting into two data threads was developed, which allowed to achieve 1.94 times increase in the com-puting speed when the matrix dimension is 15000 (2.25×106 elements). It has been found that at a small matrix dimension, the developed asynchronous algorithm using coroutines and splitting into two threads is less efficient than the single thread algo-rithm. This is due to the fact that the compiler needs some time to create threads and start execution simultaneously. With a large dimensionality, the performance of the asynchronous algorithm increases significantly. With a matrix dimension of more than 1200, the use of an asynchronous algorithm divided into two threads is guaranteed to be more efficient than a single-threaded. The data obtained are consistent with the results of similar studies of the problem of increasing the efficiency of computer modelling using alternative software and hard-ware. The new method of solving the problems of asynchronous programming provides a more efficient and simple mechanism for managing asynchronous operations.

Modern Multi-core Systems Research Articles

Related Topics

Articles published on Modern Multi-core Systems

Special edition on resource partitioning for modern multicore systems

FASA-DRAM: Reducing DRAM Latency with Destructive Activation and Delayed Restoration

Fast, parallel, and cache-friendly suffix array construction

FASA-DRAM: Reducing DRAM Latency with Destructive Activation and Delayed Restoration

ConsICA: an R package for robust reference-free deconvolution of multi-omics data.

Parallel and Flexible Dynamic Programming via the Mini-Batch Bellman Operator

RabbitQCPlus 2.0: More efficient and versatile quality control for sequencing data.

Evaluation of the efficiency of implementation of asynchronous computing algorithms using coroutines and threads in С++

RabbitFX: Efficient Framework for FASTA/Q File Parsing on Modern Multi-Core Platforms.

Scalable and Robust Snapshot Isolation for High-Performance Storage Engines

A security-aware hardware scheduler for modern multi-core systems with hard real-time constraints

Online Power Management for Multi-Cores: A Reinforcement Learning Based Approach

Accelerated K-Means Algorithms for Low-Dimensional Data on Parallel Shared-Memory Systems

Multi-core processors use for numerical problems solutions

TARTS: A Temperature-Aware Real-Time Deadline-Partitioned Fair Scheduler

Evaluating the performance of atomic operations on modern multicore systems

A Multicore Chip Load Model for PDN Analysis Considering Voltage–Current-Timing Interdependency and Operation Mode Transitions

The time and energy efficiency of modern multicore systems

Maximizing Performance under a Power Constraint on Modern Multicore Systems

Scalable Light-Weight Integration of FPGA Based Accelerators with Chip Multi-Processors

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Modern Multi-core Systems Research Articles

Related Topics

Articles published on Modern Multi-core Systems

Special edition on resource partitioning for modern multicore systems

FASA-DRAM: Reducing DRAM Latency with Destructive Activation and Delayed Restoration

Fast, parallel, and cache-friendly suffix array construction

FASA-DRAM: Reducing DRAM Latency with Destructive Activation and Delayed Restoration

ConsICA: an R package for robust reference-free deconvolution of multi-omics data.

Parallel and Flexible Dynamic Programming via the Mini-Batch Bellman Operator

RabbitQCPlus 2.0: More efficient and versatile quality control for sequencing data.

Evaluation of the efficiency of implementation of asynchronous computing algorithms using coroutines and threads in С++

RabbitFX: Efficient Framework for FASTA/Q File Parsing on Modern Multi-Core Platforms.

Scalable and Robust Snapshot Isolation for High-Performance Storage Engines

A security-aware hardware scheduler for modern multi-core systems with hard real-time constraints

Online Power Management for Multi-Cores: A Reinforcement Learning Based Approach

Accelerated K-Means Algorithms for Low-Dimensional Data on Parallel Shared-Memory Systems

Multi-core processors use for numerical problems solutions

TARTS: A Temperature-Aware Real-Time Deadline-Partitioned Fair Scheduler

Evaluating the performance of atomic operations on modern multicore systems

A Multicore Chip Load Model for PDN Analysis Considering Voltage–Current-Timing Interdependency and Operation Mode Transitions

The time and energy efficiency of modern multicore systems

Maximizing Performance under a Power Constraint on Modern Multicore Systems

Scalable Light-Weight Integration of FPGA Based Accelerators with Chip Multi-Processors