DD-αAMG on QPACE 3

Peter Georg,Tilo Wettig,Daniel Richtmann,M Della Morte,C Pena Ruano,P Fritzsch,E Gámiz Sánchez

doi:10.1051/epjconf/201817502007

Abstract

We describe our experience porting the Regensburg implementation of the DD-αAMG solver from QPACE 2 to QPACE 3. We first review how the code was ported from the first generation Intel Xeon Phi processor (Knights Corner) to its successor (Knights Landing). We then describe the modifications in the communication library necessitated by the switch from InfiniBand to Omni-Path. Finally, we present the performance of the code on a single processor as well as the scaling on many nodes, where in both cases the speedup factor is close to the theoretical expectations.

Highlights

The lattice QCD (LQCD) community has traditionally been an early adopter of new computing and network architectures
The subject of this contribution was the port of our existing code base for QPACE 2 to our new machine QPACE 3
On Knights Corner (KNC) we could achieve a significant performance gain using half precision, but on Knights Landing (KNL) half precision deteriorates performance rather than improving it, at least with our current implementation

Summary

Introduction

The lattice QCD (LQCD) community has traditionally been an early adopter of new computing and network architectures. This typically requires major efforts porting simulation code or even communication libraries. The present contribution focuses on the software efforts we made to efficiently run this implementation on QPACE 3.

Overview

DD-αAMG for Xeon Phi

On-chip strong scaling

Multi-node benchmarks

Findings

Conclusions and future opportunities

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EPJ Web of Conferences	Publication Date: Jan 1, 2018
Citations: 14	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

DD-αAMG on QPACE 3

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EPJ Web of Conferences

Lead the way for us

Similar Papers

Many-core needs fine-grained scheduling: A case study of query processing on Intel Xeon Phi processors
Xuntao Cheng ... Chiew Tong Lau
Journal of Parallel and Distributed Computing | VOL. 120
Xuntao Cheng, et. al.Xuntao Cheng ... Chiew Tong Lau
05 Dec 2017
Journal of Parallel and Distributed Computing | VOL. 120

Investigating large integer arithmetic on Intel Xeon Phi SIMD extensions
Anastasis Keliris ... Michail Maniatakos
-
Anastasis Keliris, et. al.Anastasis Keliris ... Michail Maniatakos
01 May 2014
01 May 2014

GNAQPMS v1.1: accelerating the Global Nested Air Quality Prediction Modeling System (GNAQPMS) on Intel Xeon Phi processors
Hui Wang ... Zifa Wang
Geoscientific Model Development | VOL. 10
Hui Wang, et. al.Hui Wang ... Zifa Wang
01 Aug 2017
Geoscientific Model Development | VOL. 10

On the Mitigation of Cache Hostile Memory Access Patterns on Many-Core CPU Architectures
Tom Deakin ... Simon Mcintosh-Smith
-
Tom Deakin, et. al.Tom Deakin ... Simon Mcintosh-Smith
01 Jan 2017
01 Jan 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DD-αAMG on QPACE 3

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EPJ Web of Conferences