Addressing Memory Wall Problem of Graph Computation in Reconfigurable System

Xu Wang,Linan Huang,Huwan Peng,Yongxin Zhu,Yipeng Zhou,Haifei Xiong

doi:10.1109/hpcc-css-icess.2015.77

Abstract

Graph computation problems that exhibit irregular memory access patterns are known to show poor performance on multiprocessor architectures. Although recent studies use FPGA technology to tackle the memory wall problem of graph computation by adopting a massively multi-threaded architecture, the performance is still far less than optimal memory performance due to the long memory access latency. In this paper, we address the memory wall problem by taking advantage of sequential streaming bandwidth of external DRAM memory. First, we present an edge-streaming model that streams edges from external DRAM memory while makes random access to the set of vertices in on-chip SRAM, leading to a fully utilization of external memory bandwidth in burst mode. Second, we propose an on-chip distributed off-chip shared memory architecture with a high performance shuffle network to real-timely shuffle intermediate results, which significantly reduces the requirement for intermediate buffers and saves off-chip memory bandwidth. We further use PageRank as a case study to validate the effectiveness of the proposed architecture. Evaluation results on ML605 board show that our architecture can achieve up to 4× improvement in terms of performance to bandwidth ratio over previously published FPGA-based implementations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Addressing Memory Wall Problem of Graph Computation in Reconfigurable System

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A comprehensive reconfigurable computing approach to memory wall problem of large graph computation
Xu Wang ... Yongxin Zhu
Journal of Systems Architecture | VOL. 70
Xu Wang, et. al.Xu Wang ... Yongxin Zhu
25 Apr 2016
Journal of Systems Architecture | VOL. 70

Accelerating Large-Scale Single-Source Shortest Path on FPGA
Shijie Zhou ... Viktor K Prasanna
-
Shijie Zhou, et. al.Shijie Zhou ... Viktor K Prasanna
01 May 2015
01 May 2015

Accelerating radiation dose calculation
Bo Zhou ... Xiaobo Sharon Hu
ACM Transactions on Embedded Computing Systems | VOL. 13
Bo Zhou, et. al.Bo Zhou ... Xiaobo Sharon Hu
01 Nov 2013
ACM Transactions on Embedded Computing Systems | VOL. 13

A Memory Access Scheduling Method for Multi-core Processor
Mengxiao Liu ... Weixing Ji
-
Mengxiao Liu, et. al.Mengxiao Liu ... Weixing Ji
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Addressing Memory Wall Problem of Graph Computation in Reconfigurable System

Abstract

Talk to us

Similar Papers