IPUG: Accelerating Breadth-First Graph Traversals Using Manycore Graphcore IPUs

Luk Burchard,Johannes Langguth,Daniel Thilo Schroeder,Johannes Moe,Konstantin Pogorelov

doi:10.1007/978-3-030-78713-4_16

Abstract

The Graphcore Intelligence Processing Unit (IPU) is a newly developed processor type whose architecture does not rely on the traditional caching hierarchies. Developed to meet the need for more and more data-centric applications, such as machine learning, IPUs combine a dedicated portion of SRAM with each of its numerous cores, resulting in high memory bandwidth at the price of capacity. The proximity of processor cores and memory makes the IPU a promising field of experimentation for graph algorithms since it is the unpredictable, irregular memory accesses that lead to performance losses in traditional processors with pre-caching.This paper aims to test the IPU’s suitability for algorithms with hard-to-predict memory accesses by implementing a breadth-first search (BFS) that complies with the Graph500 specifications. Precisely because of its apparent simplicity, BFS is an established benchmark that is not only subroutine for a variety of more complex graph algorithms, but also allows comparability across a wide range of architectures.We benchmark our IPU code on a wide range of instances and compare its performance to state-of-the-art CPU and GPU codes. The results indicate that the IPU delivers speedups of up to \(4{\times }\) over the fastest competing result on an NVIDIA V100 GPU, with typical speedups of about \(1.5{\times }\) on most test instances. KeywordsIPUGraph500BFSPerformance optimization

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

IPUG: Accelerating Breadth-First Graph Traversals Using Manycore Graphcore IPUs

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Fine-Grained Task Migration for Graph Algorithms Using Processing in Memory
Paula Aguilera ... Dong Ping Zhang
-
Paula Aguilera, et. al.Paula Aguilera ... Dong Ping Zhang
01 May 2016
01 May 2016

ScalaBFS: A Scalable BFS Accelerator on FPGA-HBM Platform
Chenhao Liu ... Jiajie Chen
-
Chenhao Liu, et. al.Chenhao Liu ... Jiajie Chen
17 Feb 2021
17 Feb 2021

ScalaBFS2: A High-performance BFS Accelerator on an HBM-enhanced FPGA Chip
Kexin Li ... Zhiyuan Shao
ACM Transactions on Reconfigurable Technology and Systems | VOL. 17
Kexin Li, et. al.Kexin Li ... Zhiyuan Shao
30 Apr 2024
ACM Transactions on Reconfigurable Technology and Systems | VOL. 17

Developing an Efficient Vector-Friendly Implementation of the Breadth-First Search Algorithm for NEC SX-Aurora TSUBASA
Ilya V Afanasyev ... Kazuhiko Komatsu
-
Ilya V Afanasyev, et. al.Ilya V Afanasyev ... Kazuhiko Komatsu
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

IPUG: Accelerating Breadth-First Graph Traversals Using Manycore Graphcore IPUs

Abstract

Talk to us

Similar Papers