Mapping and Optimization Method of SpMV on Multi-DSP Accelerator

Yasong Cao,Shuwei Sun,Sheng Liu

doi:10.3390/electronics11223699

Yasong Cao, Shuwei Sun + Show 1 more

Open Access

https://doi.org/10.3390/electronics11223699

Copy DOI

Journal: Electronics	Publication Date: Nov 11, 2022
Citations: 2	License type: CC BY 4.0

Affiliation: National University of Defense Technology

Abstract

Sparse matrix-vector multiplication (SpMV) solves the product of a sparse matrix and dense vector, and the sparseness of a sparse matrix is often more than 90%. Usually, the sparse matrix is compressed to save storage resources, but this causes irregular access to dense vectors in the algorithm, which takes a lot of time and degrades the SpMV performance of the system. In this study, we design a dedicated channel in the DMA to implement an indirect memory access process to speed up the SpMV operation. On this basis, we propose six SpMV algorithm schemes and map them to optimize the performance of SpMV. The results show that the M processor’s SpMV performance reached 6.88 GFLOPS. Besides, the average performance of the HPCG benchmark is 2.8 GFLOPS.

Full Text