A Streaming Dataflow Engine for Sparse Matrix-Vector Multiplication Using High-Level Synthesis

Mohammad Hosseinabady,Jose Luis Nunez-Yanez

doi:10.1109/tcad.2019.2912923

Abstract

Using high-level synthesis techniques, this paper proposes an adaptable high-performance streaming dataflow engine for sparse matrix dense vector multiplication (SpMV) suitable for embedded FPGAs. As the SpMV is a memory-bound algorithm, this engine combines the three concepts of loop pipelining , dataflow graph , and data streaming to utilize most of the memory bandwidth available to the FPGA. The main goal of this paper is to show that FPGAs can provide comparable performance for memory-bound applications to that of the corresponding CPUs and GPUs but with significantly less energy consumption. The experimental results indicate that the FPGA provides higher performance compared to that of embedded GPUs for small and medium-size matrices by an average factor of 3.25 whereas the embedded GPU is faster for larger size matrices by an average factor of 1.58. In addition, the FPGA implementation is more energy efficient for the range of considered matrices by an average factor of 8.9 compared to the embedded CPU and GPU. A case study based on adapting the proposed SpMV optimization to accelerate the support vector machine (SVM) algorithm, one of the successful classification techniques in the machine learning literature, justifies the benefits of utilizing the proposed FPGA-based SpMV compared to that of the embedded CPU and GPU. The experimental results show that the FPGA is faster by an average factor of 1.7 and consumes less energy by an average factor of 6.8 compared to the GPU.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Streaming Dataflow Engine for Sparse Matrix-Vector Multiplication Using High-Level Synthesis

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

Lead the way for us

Journal: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems	Publication Date: Jun 6, 2019
Citations: 39

Similar Papers

Study on Gene Splicing Site Recognition Based on Particle Swarm Optimization Twin Support Vector Machine Algorithm for Smart Healthcare
Bo Wang ... Yiou Wang
Wireless Communications and Mobile Computing | VOL. 2023
Bo Wang, et. al.Bo Wang ... Yiou Wang
21 Apr 2023
Wireless Communications and Mobile Computing | VOL. 2023

Determination of Granting Appropriateness Credit at “Daruzzakah Rensing” Cooperative Using the Support Vector Machine (SVM) Algorithm
Nurhidayati ... Yahya
Journal of Physics: Conference Series | VOL. 1539
Nurhidayati, et. al. Nurhidayati ... Yahya
01 May 2020
Journal of Physics: Conference Series | VOL. 1539

Gradient Evolution-based Support Vector Machine Algorithm for Classification
Ferani E Zulvia ... R J Kuo
IOP Conference Series: Materials Science and Engineering | VOL. 319
Ferani E Zulvia, et. al.Ferani E Zulvia ... R J Kuo
01 Mar 2018
IOP Conference Series: Materials Science and Engineering | VOL. 319

Power Theft Detection Using Novel Linear SVM Algorithm and Compared With Convolutional SVM Algorithm For Accuracy
Murugesan ... M Vishnu Priya
-
Murugesan, et. al. Murugesan ... M Vishnu Priya
12 Nov 2022
12 Nov 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Streaming Dataflow Engine for Sparse Matrix-Vector Multiplication Using High-Level Synthesis

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems