Multi-core PC Research Articles

Image reconstruction in soft-field tomography is based on an inverse problem formulation, where a forward model is fitted to the data. In medical applications, where the anatomy presents complex shapes, it is common to use finite element models (FEMs) to represent the volume of interest and solve a partial differential equation that models the physics of the system. Over the last decade, there has been a shifting interest from 2D modeling to 3D modeling, as the underlying physics of most problems are 3D. Although the increased computational power of modern computers allows working with much larger FEM models, the computational time required to reconstruct 3D images on a fine 3D FEM model can be significant, on the order of hours. For example, in electrical impedance tomography (EIT) applications using a dense 3D FEM mesh with half a million elements, a single reconstruction iteration takes approximately 15–20 min with optimized routines running on a modern multi-core PC. It is desirable to accelerate image reconstruction to enable researchers to more easily and rapidly explore data and reconstruction parameters. Furthermore, providing high-speed reconstructions is essential for some promising clinical application of EIT. For 3D problems, 70% of the computing time is spent building the Jacobian matrix, and 25% of the time in forward solving. In this work, we focus on accelerating the Jacobian computation by using single and multiple GPUs. First, we discuss an optimized implementation on a modern multi-core PC architecture and show how computing time is bounded by the CPU-to-memory bandwidth; this factor limits the rate at which data can be fetched by the CPU. Gains associated with the use of multiple CPU cores are minimal, since data operands cannot be fetched fast enough to saturate the processing power of even a single CPU core. GPUs have much faster memory bandwidths compared to CPUs and better parallelism. We are able to obtain acceleration factors of 20 times on a single NVIDIA S1070 GPU, and of 50 times on four GPUs, bringing the Jacobian computing time for a fine 3D mesh from 12 min to 14 s. We regard this as an important step toward gaining interactive reconstruction times in 3D imaging, particularly when coupled in the future with acceleration of the forward problem. While we demonstrate results for EIT, these results apply to any soft-field imaging modality where the Jacobian matrix is computed with the adjoint method.

The LHCb Experiment is a hadronic precision experiment at the LHC accelerator aimed at mainly studying b-physics by profiting from the large b-anti-b-production at LHC. The challenge of high trigger efficiency has driven the choice of a readout architecture allowing the main event filtering to be performed by a software trigger with access to all detector information on a processing farm based on commercial multicore PCs. The readout architecture therefore features only a relatively relaxed hardware trigger with a fixed and short latency accepting events at 1 MHz out of a nominal proton collision rate of 30 MHz, and high bandwidth with event fragment assembly over Gigabit Ethernet. A fast central system performs the entire synchronization, event labelling and control of the readout, as well as event management including destination control, dynamic load balancing of the readout network and the farm, and handling of special events for calibrations and luminosity measurements. The event filter farm processes the events in parallel and reduces the physics event rate to about 2 kHz which are formatted and written to disk before transfer to the offline processing. A spy mechanism allows processing and reconstructing a fraction of the events for online quality checking. In addition a 5 Hz subset of the events are sent as express stream to offline for checking calibrations and software before launching the full offline processing on the main event stream. In this paper, we will give an overview of the readout system, and describe the real-time event management and the experience with the system during the commissioning phase with cosmic rays and first LHC beams.

Multi-core PC Research Articles

Related Topics

Articles published on Multi-core PC

High performance parallel $$k$$ k -means clustering for disk-resident datasets on multi-core CPUs

Parallel Dijkstra's Algorithm Based on Multi-Core and MPI

Development and performance analysis of a parallel Monte Carlo neutron transport simulation program for GPU-Cluster using MPI and CUDA technologies

Multi-GPU Jacobian accelerated computing for soft-field tomography

IMPLEMENTATION OF THE DISTRIBUTED PARALLEL PROGRAM FOR GEOID HEIGHTS COMPUTATION USING MPI AND OPENMP

Efficient system-enforced deterministic parallelism

A Parallel Differential Box-Counting Algorithm Applied to Hyperspectral Image Classification

Parallel processing for stepwise generalisation method on multi-core PC cluster

Development of High Level Trigger Software for Belle II at SuperKEKB

Optimizing image processing on multi-core CPUs with Intel parallel programming technologies

MetAlign 3.0: performance enhancement by efficient use of advances in computer hardware.

Cooperative and competitive concurrency in scientific computing. A full open-source upgrade of the program for dynamical calculations of RHEED intensity oscillations

Inline inspection of textured plastics surfaces

Portable and Simple Technique Using Multi Core and Multiple GbE Ports for Commodity PC Clusters

Research and implementation of parallel rendering system based on multi-core PC cluster

The LHCb Readout System and Real-Time Event Management

Fast and robust multi-atlas segmentation of brain magnetic resonance images

The Beowulf Analysis Symbolic INterface: Interactive Parallel Data Analysis for Everyone

Using hybrid MPI and OpenMP programming to optimize communications in parallel loop self-scheduling schemes for multicore PC clusters

マルチコアＰＣクラスタにおける電力系統シミュレーションの並列化手法の検討

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multi-core PC Research Articles

Related Topics

Articles published on Multi-core PC

High performance parallel $$k$$ k -means clustering for disk-resident datasets on multi-core CPUs

Parallel Dijkstra's Algorithm Based on Multi-Core and MPI

Development and performance analysis of a parallel Monte Carlo neutron transport simulation program for GPU-Cluster using MPI and CUDA technologies

Multi-GPU Jacobian accelerated computing for soft-field tomography

IMPLEMENTATION OF THE DISTRIBUTED PARALLEL PROGRAM FOR GEOID HEIGHTS COMPUTATION USING MPI AND OPENMP

Efficient system-enforced deterministic parallelism

A Parallel Differential Box-Counting Algorithm Applied to Hyperspectral Image Classification

Parallel processing for stepwise generalisation method on multi-core PC cluster

Development of High Level Trigger Software for Belle II at SuperKEKB

Optimizing image processing on multi-core CPUs with Intel parallel programming technologies

MetAlign 3.0: performance enhancement by efficient use of advances in computer hardware.

Cooperative and competitive concurrency in scientific computing. A full open-source upgrade of the program for dynamical calculations of RHEED intensity oscillations

Inline inspection of textured plastics surfaces

Portable and Simple Technique Using Multi Core and Multiple GbE Ports for Commodity PC Clusters

Research and implementation of parallel rendering system based on multi-core PC cluster

The LHCb Readout System and Real-Time Event Management

Fast and robust multi-atlas segmentation of brain magnetic resonance images

The Beowulf Analysis Symbolic INterface: Interactive Parallel Data Analysis for Everyone

Using hybrid MPI and OpenMP programming to optimize communications in parallel loop self-scheduling schemes for multicore PC clusters

マルチコアＰＣクラスタにおける電力系統シミュレーションの並列化手法の検討