Implementation and Optimization of a CFD Solver Using Overlapped Meshes on Multiple MIC Coprocessors

Wenpeng Ma,Wu Yuan,Xiaodong Hu

doi:10.1155/2019/4254676

Abstract

In this paper, we develop and parallelize a CFD solver that supports overlapped meshes on multiple MIC architectures by using multithreaded technique. We optimize the solver through several considerations including vectorization, memory arrangement, and an asynchronous strategy for data exchange on multiple devices. Comparisons of different vectorization strategies are made, and the performances of core functions of the solver are reported. Experiments show that about 3.16x speedup can be achieved for the six core functions on a single Intel Xeon Phi 5110P MIC card, and 5.9x speedup can be achieved using two cards compared to an Intel E5-2680 processor for two ONERA M6 wings case.

Highlights

Computing with accelerators such as graphics processing unit (GPU) [1] and Intel many integrated core (MIC) architecture [2] has been attractive in computational fluid dynamics (CFD) areas recent years because it provides researchers with the possibility of accelerating or scaling their numerical codes by various parallel techniques
Our experiments were conducted on the YUAN cluster at the Computer Network Information Center at the Chinese Academy of Sciences. e cluster is of hybrid architecture that consists of both MIC and GPU nodes. e configuration of MIC nodes is that each node has two Intel E5-2680 V2 (Ivy Bridge, 2.8 GHz, 10 cores) CPUs and two Intel Xeon Phi 5110P MIC coprocessors. e memory capacity for the host and coprocessors is 64 GB and 8 GB, respectively
We used two ONERA M6 wings, each of which was configured with four 129 × 113 × 105 subblocks. e lower wing and its mesh system were formed by making a translation of the upper wing down along Y-axis by the length of the wing, and the two mesh systems overlapped with each other

Summary

Introduction

Computing with accelerators such as graphics processing unit (GPU) [1] and Intel many integrated core (MIC) architecture [2] has been attractive in computational fluid dynamics (CFD) areas recent years because it provides researchers with the possibility of accelerating or scaling their numerical codes by various parallel techniques. Intel MIC architecture consists of processors that inherit many key features of Intel CPU cores, which makes the code migrating less expensive and become popular in the development of parallel algorithms. Many researchers [13,14,15,16,17] have studied GPU computing on structured meshes, which involved coalesced computation technique [13], heterogeneous algorithm [15, 17], numerical methods [16], etc. Corrigan et al [18] investigated an Euler solver on GPU by employing unstructured grid and gained important factor of speedup over CPUs. en, a lot of results included data structure

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Programming	Publication Date: May 27, 2019
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Implementation and Optimization of a CFD Solver Using Overlapped Meshes on Multiple MIC Coprocessors

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Programming

Lead the way for us

Similar Papers

AAlign: A SIMD Framework for Pairwise Sequence Alignment on x86-Based Multi-and Many-Core Processors
Kaixi Hou ... Hao Wang
-
Kaixi Hou, et. al.Kaixi Hou ... Hao Wang
01 May 2016
01 May 2016

Gender Differences in Motivation and Teacher Performance in Core Functions in Kenyan Secondary Schools
Catherine K. Wanakacha ... Philip Nyaswa
Academic Journal of Interdisciplinary Studies | VOL. 7
Catherine K. Wanakacha, et. al.Catherine K. Wanakacha ... Philip Nyaswa
01 Mar 2018
Academic Journal of Interdisciplinary Studies | VOL. 7

Implementation of High-Order Multireference Coupled-Cluster Methods on Intel Many Integrated Core Architecture
E Aprà ... K Kowalski
Journal of Chemical Theory and Computation | VOL. 12
E Aprà, et. al.E Aprà ... K Kowalski
05 Feb 2016
Journal of Chemical Theory and Computation | VOL. 12

Unified Heterogeneous Networking Design
Amandeep Singh ... Peter Thermos
-
Amandeep Singh, et. al.Amandeep Singh ... Peter Thermos
15 Oct 2013
15 Oct 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Implementation and Optimization of a CFD Solver Using Overlapped Meshes on Multiple MIC Coprocessors

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Programming