Near-Memory Processing in Action: Accelerating Personalized Recommendation With AxDIMM

Liu Ke,Yongsuk Kwon,Joonho Song,Jeonghyeon Cho,Sukhan Lee,Xuan Zhang,Jinin So,Jin Hyun Kim,Yeongon Cho,Ilkwon Yun,Nam Sung Kim,Sung Joo Park,Jin Chul Jung ,Hyun Sun Park ,Shin Haeng Kang ,Kyung-Soo Kim ,Song-Yi Han ,Jong Geon Lee ,Kiwon Sohn ,Hsien Hsin Sean Lee

doi:10.1109/mm.2021.3097700

Abstract

Near-memory processing (NMP) is a prospective paradigm enabling memory-centric computing. By moving the compute capability next to the main memory (DRAM modules), it can fundamentally address the CPU-memory bandwidth bottleneck and thus effectively improve the performance of memory-constrained workloads. Using the personalized recommendation system as a driving example, we developed a scalable, practical DIMM-based NMP solution tailor-designed for accelerating the inference serving. Our solution is demonstrated on a versatile FPGA-enabled NMP platform called AxDIMM that allows rapid prototyping and evaluation of NMP’s performance potential on real hardware under a realistic system setting using industry-representative recommendation framework. We experimentally validated the performance of a two-ranked AxDIMM prototype, which achieves up to 1.89× speedup in latency and 31.6% memory energy saving for embedding operations. For end-to-end recommendation inference serving, AxDIMM improves the throughput up to 1.5× and latency-bounded throughput up to 1.77×, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Near-Memory Processing in Action: Accelerating Personalized Recommendation With AxDIMM

Abstract

Talk to us

Similar Papers

More From: IEEE Micro

Lead the way for us

Journal: IEEE Micro	Publication Date: Jan 1, 2022
Citations: 43

Similar Papers

SPACE: Locality-Aware Processing in Heterogeneous Memory for Personalized Recommendations
Hongju Kal ... Gun Ko
-
Hongju Kal, et. al.Hongju Kal ... Gun Ko
01 Jun 2021
01 Jun 2021

ABC-DIMM: Alleviating the Bottleneck of Communication in DIMM-based Near-Memory Processing with Inter-DIMM Broadcast
Weiyi Sun ... Shaojun Wei
-
Weiyi Sun, et. al.Weiyi Sun ... Shaojun Wei
01 Jun 2021
01 Jun 2021

Power-Time Exploration Tools for NMP-Enabled Systems
Chae Eun Rhee ... Hyuk-Jae Lee
Electronics | VOL. 8
Chae Eun Rhee, et. al.Chae Eun Rhee ... Hyuk-Jae Lee
28 Sep 2019
Electronics | VOL. 8

Accelerating Personalized Recommendation with Cross-level Near-Memory Processing
Haifeng Liu ... Jingling Xue
-
Haifeng Liu, et. al.Haifeng Liu ... Jingling Xue
17 Jun 2023
17 Jun 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Near-Memory Processing in Action: Accelerating Personalized Recommendation With AxDIMM

Abstract

Talk to us

Similar Papers

More From: IEEE Micro