Open MPI Research Articles

PurposeThe purpose of this paper is to evaluate how to use nodes in a cluster efficiently by studying the NAS Parallel Benchmarks (NASPB) on Intel Xeon and AMD Opteron dual CPU Linux clusters.Design/methodology/approachThe performance results of the NASPB are presented both with one MPI process per node (1 ppn) and with two MPI processes per node (2 ppn). These benchmark results were analyzed by considering the impact of cache effects, code scalability, memory bandwidth within nodes, and the impact of MPI and the MPI communication network. Memory bandwidth was benchmarked using MPI versions of the Streams benchmarks. The impact of MPI and the MPI communication network are evaluated by benchmarking the performance of MPI sends and receives, MPI broadcast, and the MPI all‐to‐all routines.FindingsThe performance results from running the NASPB and from the memory bandwidth benchmarks show that better performance can sometimes be achieved using 1 ppn. Performance results show that the AMD Opteron/Myrinet cluster is able to achieve significantly better utilization of the second processor than the Intel Xeon/Myrinet cluster.Practical implicationsMost Linux clusters are purchased with two processors per node. One would like to run all applications on a cluster with two processors per node using 2 ppn instead of 1 ppn in order to utilize the second processor on each node. However, our results show that this is not always the best choice. Users should always assess their program performance with both 1 ppn and 2 ppn before running production calculations. This issue becomes even more important with the emergence of multi‐core processors.Originality/valueTo the authors' best knowledge, this is the only detailed comparison of AMD Opteron and Intel Xeon dual processor node parallel performance on large Myrinet clusters. The paper should be of value to everybody considering running on or purchasing AMD or Intel‐based Linux cluster.

Read full abstract

High performance computing on parallel architectures currently uses different approaches depending on the hardware memory model of the architecture, the abstraction level of the programming environment and the nature of the application. In this article, we introduce an original client–server execution model based on RPCs called out-of-order parallel virtual machine (OVM). OVM aims to provide three main features: portability through a unique memory model, load-balancing using a plug-in support and high performance provided by several optimizations. The main optimizations are: non-blocking RPCs, data-flow management, persistent and non-persistent data, static data set distribution, dynamic scheduling and asynchronous global operations. We present OVM general architecture and demonstrate high performance for regular parallel applications, a parallel application with load balancing needs and a parallel application with real-time constraints. We firstly compare the performance of OVM and MPI for three kernels of the NAS 2.3. Then we illustrate the performance capability of OVM for a large real-life application that needs a load balancing support called AIRES. Finally, we present the performance of a real-time version of the PovRay ray-tracer demonstrating the reactiveness of OVM.

Read full abstract

Open MPI Research Articles

Related Topics

Articles published on Open MPI

Parallel Option Price Valuations with the Explicit Finite Difference Method

De novo transcriptome assembly with ABySS

Test suite for evaluating performance of multithreaded MPI communication

Using benchmarking to determine efficient usage of nodes in a cluster

The Open Run-Time Environment (OpenRTE): A transparent multicluster environment for high-performance computing

Open MPI: A High Performance, Flexible Implementation of MPI Point-to-Point Communications

Construction of Hybrid MPI-OpenMP Solutions for SMP Clusters

Performance and scalability of MPI on PC clusters

OVM: Out-of-order execution parallel virtual machine

Scalability and performance of OpenMP and MPI on a 128‐processor SGI Origin 2000

Development of Mixed Mode MPI / OpenMP Applications

Computational chemistry on Fujitsu vector–parallel processors: Hardware and programming environment

High-energy physics software parallelization using database techniques

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Open MPI Research Articles

Related Topics

Articles published on Open MPI

Parallel Option Price Valuations with the Explicit Finite Difference Method

De novo transcriptome assembly with ABySS

Test suite for evaluating performance of multithreaded MPI communication

Using benchmarking to determine efficient usage of nodes in a cluster

The Open Run-Time Environment (OpenRTE): A transparent multicluster environment for high-performance computing

Open MPI: A High Performance, Flexible Implementation of MPI Point-to-Point Communications

Construction of Hybrid MPI-OpenMP Solutions for SMP Clusters

Performance and scalability of MPI on PC clusters

OVM: Out-of-order execution parallel virtual machine

Scalability and performance of OpenMP and MPI on a 128‐processor SGI Origin 2000

Development of Mixed Mode MPI / OpenMP Applications

Computational chemistry on Fujitsu vector–parallel processors: Hardware and programming environment

High-energy physics software parallelization using database techniques