Development Of Parallel Applications Research Articles

Monte Carlo transport calculations for fusion reactors are very challenging due to factors such as large radiation decay gradients, complex geometric structures, and strong neutron anisotropy. For such deep penetration shielding calculation problems, in order to obtain accurate global flux distribution, variance reduction methods and parallel computing are necessary means. When using the global weight window variance reduction technique, the huge deviation of the particle arrival weight and the weight window can lead to an extreme number of splits, which is the long history problem. If a process encounters the long history problem in parallel computing and takes a lot of time to complete, other processes will stop computing because the histories allocated in the computing cycle have been completed, seriously degrading computing efficiency. In addition, the calculation of the shutdown dose rate, a key issue in fusion neutronics calculations, requires a high “space-energy” resolution, which brings huge challenges to computational memory, especially in parallel computing. In order to overcome the degradation of parallel efficiency caused by the long history problem and the memory limitation in large-scale shutdown dose rate calculation, a hybrid parallel algorithm based on shared memory was researched. Finally, the hybrid parallel architecture was built in the Monte Carlo code cosRMC to realize dynamic load balancing between threads. Through the memory sharing among all threads of a single node, the pressure of memory consumption can be effectively relieved. In order to verify the effectiveness of the hybrid parallel algorithm developed in this paper, it was applied to the calculation of CFETR. The results showed that the hybrid parallel algorithm can effectively alleviate the impact of the long history problem on the parallel efficiency. In terms of saving memory, the hybrid parallel algorithm also showed a significant effect, which provides a guarantee for large-scale shutdown dose rate calculation.

Read full abstract

In this paper, the isoefficiency of MPP systems and heterogeneous CPU-GPU systems on the problem of discrete Fourier transform is considered. The development of parallel applications as its goal can not only reduce execution time, but also provide opportunities to solve problems of a larger dimension. The peculiarity of algorithm parallelization includes the efficient use of hardware while increasing the dimension of the problem is an important characteristic of parallel computing. However, currently heterogeneous systems have not been researched extensively to determine isoefficiency characteristics and build application-specific systems around said method, although there are articles that show potential using isoefficiency to design the system and using heterogeneous approach to accelerate performance of different tasks. Discrete Fourier Transform algorithm lets build systems that discretize analogue and digital signals and it can serve as a benchmark to test different systems. Algorithms suited for MPP systems can use analytical approach to find out issoefficiency function and to determine how scaling the system or changing the size of the task will change its performance metrics. One of the most popular approaches to linking up processing units in MPP systems is using hypercube topology. MPP system that is connected using this topology will be analyzed. CPU-GPU heterogeneous system will be analyzed using an approach based on polynomial regression. Due to the nature of heterogeneous systems, analytic approach used in MPP system is impossible. Predictive model based on polynomial regression will use modelling results from using CPU and GPU separately to estimate how much time it will take for heterogeneous system to finish the task. To ensure accuracy of the experiment, several systems will be used to model the task. Using this approach, resulting issoefficient heterogeneous system will be analyzed using performance metrics s

Read full abstract

Development Of Parallel Applications Research Articles

Related Topics

Articles published on Development Of Parallel Applications

Boosting HPC data analysis performance with the ParSoDA-Py library

Development of Monte Carlo hybrid parallel algorithm and application in high resolution spatial-energy flux distribution calculation

A parallel programming assessment for stream processing applications on multi-core systems

Stencil Calculations with Algorithmic Skeletons for Heterogeneous Computing Environments

Evaluation of Intel's DPC++ Compatibility Tool in heterogeneous computing

СПОСІБ РОЗРОБКИ ІЗОЕФЕКТИВНОЇ ГЕТЕРОГЕННОЇ СИСТЕМИ НА ОСНОВІ МАШИННОГО НАВЧАННЯ ДЛЯ ЗАДАЧІ ДИСКРЕТНОГО ПЕРЕТВОРЕННЯ ФУР’Є

An MPI-based MPSoC Platform in FPGA

Performance Reduction for Automatic Development of Parallel Applications for Reconfigurable Computer Systems

A Novel, Highly Integrated Simulator for Parallel and Distributed Systems

Parallelisation of practical shared sampling alpha matting with OpenMP

Parallelisation of practical shared sampling alpha matting with OpenMP

On the code modernization of shared sampling alpha matting with OpenMP

Upgrading a high performance computing environment for massive data processing

DtCraft: A High-Performance Distributed Execution Engine at Scale

Impact study of data locality on task-based applications through the Heteroprio scheduler.

Transforming powerlist-based divide-and-conquer programs for an improved execution model

RDGC: A Reuse Distance-Based Approach to GPU Cache Performance Analysis

On parallelisation of image dehazing with OpenMP

A Very Fast Trace-Driven Simulation Platform for Chip-Multiprocessors Architectural Explorations

Algorithms for Balanced Graph Colorings with Applications in Parallel Computing

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Development Of Parallel Applications Research Articles

Related Topics

Articles published on Development Of Parallel Applications

Boosting HPC data analysis performance with the ParSoDA-Py library

Development of Monte Carlo hybrid parallel algorithm and application in high resolution spatial-energy flux distribution calculation

A parallel programming assessment for stream processing applications on multi-core systems

Stencil Calculations with Algorithmic Skeletons for Heterogeneous Computing Environments

Evaluation of Intel's DPC++ Compatibility Tool in heterogeneous computing

СПОСІБ РОЗРОБКИ ІЗОЕФЕКТИВНОЇ ГЕТЕРОГЕННОЇ СИСТЕМИ НА ОСНОВІ МАШИННОГО НАВЧАННЯ ДЛЯ ЗАДАЧІ ДИСКРЕТНОГО ПЕРЕТВОРЕННЯ ФУР’Є

An MPI-based MPSoC Platform in FPGA

Performance Reduction for Automatic Development of Parallel Applications for Reconfigurable Computer Systems

A Novel, Highly Integrated Simulator for Parallel and Distributed Systems

Parallelisation of practical shared sampling alpha matting with OpenMP

Parallelisation of practical shared sampling alpha matting with OpenMP

On the code modernization of shared sampling alpha matting with OpenMP

Upgrading a high performance computing environment for massive data processing

DtCraft: A High-Performance Distributed Execution Engine at Scale

Impact study of data locality on task-based applications through the Heteroprio scheduler.

Transforming powerlist-based divide-and-conquer programs for an improved execution model

RDGC: A Reuse Distance-Based Approach to GPU Cache Performance Analysis

On parallelisation of image dehazing with OpenMP

A Very Fast Trace-Driven Simulation Platform for Chip-Multiprocessors Architectural Explorations

Algorithms for Balanced Graph Colorings with Applications in Parallel Computing