A Performance Study of a Dual Xeon-Phi Cluster for the Forward Modelling of Gravitational Fields

Maricela Arroyo,Nain Vera-Chavez,Israel-Enrique Herrera-Diaz,Carlos Couder-Castañeda,Alfredo Trujillo-Alcantara

doi:10.1155/2015/316012

Abstract

With at least 60 processing cores, the Xeon-Phi coprocessor is a truly multicore architecture, which consists of an interconnection speed among cores of 240 GB/s, two levels of cache memory, a theoretical performance of 1.01 Tflops, and programming flexibility, all making the Xeon-Phi an excellent coprocessor for parallelizing applications that seek to reduce computational times. The objective of this work is to migrate a geophysical application designed to directly calculate the gravimetric tensor components and their derivatives and in this way research the performance of one and two Xeon-Phi coprocessors integrated on the same node and distributed in various nodes. This application allows the analysis of the design factors that drive good performance and compare the results against a conventional multicore CPU. This research shows an efficient strategy based on nested parallelism using OpenMP, a design that in its outer structure acts as a controller of interconnected Xeon-Phi coprocessors while its interior is used for parallelyzing the loops. MPI is subsequently used to reduce the information among the nodes of the cluster.

Highlights

The Xeon-Phi coprocessor is one of the new architectures designed for high performance computing (HPC)
This research shows an efficient strategy based on nested parallelism using OpenMP, a design that in its outer structure acts as a controller of interconnected Xeon-Phi coprocessors while its interior is used for parallelyzing the loops
The tensor results obtained from G are shown in Figure 18; these results offer an important understanding into the definition of the geometry of bodies in order to characterize the structures of economic interest, since the full tensor gravity (FTG) data were able to detect superficial bodies, bodies with measured and exact amplitudes in a range of −10 to 7 Eotvos

Summary

Introduction

The Xeon-Phi coprocessor is one of the new architectures designed for high performance computing (HPC). The Xeon-Phi is a X86 multicore architecture with low power consumption and with a theoretical performance of one Teraflop, for which it utilizes 60 real processing cores, a 30 MB cache, and high band-width interconnection [1]. One of the current research activities is to analyse the features and benefits of each new technology that emerges across the field of supercomputers, which is based, today, on GPUs and coprocessors. The effort to migrate scientific applications to CUDA (for GPUs) or OpenCL (i.e., programming that requires programming low level kernels) is often much higher as compared to directivebased programming like OpenMP (for CPU or MIC) [2]. Prior experiments which test the functionality of the XeonPhi coprocessor show that migrating scientific software is relatively easy, thereby making the MICs a promising tool for HPC applications [3]

Objectives

Methods

Findings

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Programming	Publication Date: Jan 1, 2015
Citations: 3	License type: CC BY 3.0

R Discovery Prime

R Discovery Prime

A Performance Study of a Dual Xeon-Phi Cluster for the Forward Modelling of Gravitational Fields

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Programming

Lead the way for us

Similar Papers

Area-efficient floorplans and interconnects for homogeneous multi-core architectures
Fadi N Sibai
International Journal of High Performance Systems Architecture | VOL. 1
Fadi N SibaiFadi N Sibai
01 Jan 2008
International Journal of High Performance Systems Architecture | VOL. 1

Efficient AdHoc Ondemand Distance Vector Routing Protocol using Link State Algorithm
Mustafa Jasim Al-Jubori ... S S Pawale
International Journal of Computer Applications | VOL. 26
Mustafa Jasim Al-Jubori, et. al.Mustafa Jasim Al-Jubori ... S S Pawale
31 Jul 2011
International Journal of Computer Applications | VOL. 26

Modern multicore and manycore architectures: Modelling, optimisation and benchmarking a multiblock CFD code
Ioan Hadade ... Luca Di Mare
Computer Physics Communications | VOL. 205
Ioan Hadade, et. al.Ioan Hadade ... Luca Di Mare
22 Apr 2016
Computer Physics Communications | VOL. 205

Parallelization of FDM/FEM computation for PDEs on PARAM YUVA-II cluster of Xeon Phi coprocessors
Sonia Rani ... Krishan Gopal Gupta
-
Sonia Rani, et. al.Sonia Rani ... Krishan Gopal Gupta
01 Dec 2014
01 Dec 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Performance Study of a Dual Xeon-Phi Cluster for the Forward Modelling of Gravitational Fields

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Programming