A heterogeneous 3-D stacked PIM accelerator for GCN-based recommender systems

Xinyang Shen,Long Zheng,Yu Huang,Hai Jin,Xiaofei Liao

doi:10.1007/s42514-024-00180-4

Abstract

AbstractModern recommendation systems integrate graph convolution neural networks (GCN) for enhancing embedding representation. Compared with widely deployed neural network-based models, the extra message propagation layer of GCN-based recommendation is featured with extensive computations and irregular memory access. However, architecture designs for prevailing deep neural network recommendation models assume simple pooling in the embedding layer. ReRAM-based GCN accelerators are specialized for graph-related operations. However, they are designed for general graphs, while GCN-based recommendation models mainly operate on the user-item graph. In this paper, we proposed a resistive random accessed memory (ReRAM) based processing-in-memory (PIM) accelerator, ReGCNR, for GCN-based recommendation. ReGCNR is featured with three key innovations. First, we exploit the 3-dimensional (3-D) stacked heterogeneous ReRAM to fit with the large-size embedding table and user-item graph. Then, we propose a joint degree mapping schema that maximizes the efficiency of the execution pipeline. After that, ReGCNR assembles a well-coordinated pipeline and hardware scheduling design to boost overall system performance. Results show that ReGCNR outperforms GPU by 69.83$$\times$$ × and 56.67$$\times$$ × in terms of average speedup and energy saving, respectively. In addition, ReGCNR outperforms state-of-the-art ReRAM-based solutions by 11.13$$\times$$ × speedups and 7.22$$\times$$ × energy savings on average.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: CCF Transactions on High Performance Computing	Publication Date: Feb 28, 2024
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A heterogeneous 3-D stacked PIM accelerator for GCN-based recommender systems

Abstract

Talk to us

Similar Papers

More From: CCF Transactions on High Performance Computing

Lead the way for us

Similar Papers

FPGA Acceleration of GCN in Light of the Symmetry of Graph Adjacency Matrix
Gopikrishnan Raveendran Nair ... Yu Cao
-
Gopikrishnan Raveendran Nair, et. al.Gopikrishnan Raveendran Nair ... Yu Cao
01 Apr 2023
01 Apr 2023

Enabling “Untact” Culture via Online Product Recommendations: An Optimized Graph-CNN based Approach
Wafa Shafqat ... Yung-Cheol Byun
Applied Sciences | VOL. 10
Wafa Shafqat, et. al.Wafa Shafqat ... Yung-Cheol Byun
06 Aug 2020
Applied Sciences | VOL. 10

Hybrid text classification model based on graph convolution network and neural network
Zhaohe Dong ... Zhengli Zhai
-
Zhaohe Dong, et. al.Zhaohe Dong ... Zhengli Zhai
01 Jun 2023
01 Jun 2023

Multilingual Sentiment Recommendation System based on Multilayer Convolutional Neural Networks (MCNN) and Collaborative Filtering based Multistage Deep Neural Network Models (CFMDNN)
Maram Almaghrabi ... Girija Chetty
-
Maram Almaghrabi, et. al.Maram Almaghrabi ... Girija Chetty
01 Nov 2020
01 Nov 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A heterogeneous 3-D stacked PIM accelerator for GCN-based recommender systems

Abstract

Talk to us

Similar Papers

More From: CCF Transactions on High Performance Computing