ComputeDRAM

Fei Gao,Georgios Tziantzioulis,David Wentzlaff

doi:10.1145/3352460.3358260

Abstract

In-memory computing has long been promised as a solution to the Memory Wall problem. Recent work has proposed using chargesharing on the bit-lines of a memory in order to compute in-place and with massive parallelism, all without having to move data across the memory bus. Unfortunately, prior work has required modification to RAM designs (e.g. adding multiple row decoders) in order to open multiple rows simultaneously. So far, the competitive and low-margin nature of the DRAM industry has made commercial DRAM manufacturers resist adding any additional logic into DRAM. This paper addresses the need for in-memory computation with little to no change to DRAM designs. It is the first work to demonstrate in-memory computation with off-the-shelf, unmodified, commercial, DRAM. This is accomplished by violating the nominal timing specification and activating multiple rows in rapid succession, which happens to leave multiple rows open simultaneously, thereby enabling bit-line charge sharing. We use a constraint-violating command sequence to implement and demonstrate row copy, logical OR, and logical AND in unmodified, commodity, DRAM. Subsequently, we employ these primitives to develop an architecture for arbitrary, massively-parallel, computation. Utilizing a customized DRAM controller in an FPGA and commodity DRAM modules, we characterize this opportunity in hardware for all major DRAM vendors. This work stands as a proof of concept that in-memory computation is possible with unmodified DRAM modules and that there exists a financially feasible way for DRAM manufacturers to support in-memory compute.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ComputeDRAM

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

IN-MEMORY COMPUTING WITH CMOS AND EMERGING MEMORY TECHNOLOGIES

-

17 Oct 2019
17 Oct 2019

FAST: A Fully-Concurrent Access SRAM Topology for High Row-Wise Parallelism Applications Based on Dynamic Shift Operations
Yiming Chen ... Yongpan Liu
IEEE Transactions on Circuits and Systems II: Express Briefs | VOL. 70
Yiming Chen, et. al.Yiming Chen ... Yongpan Liu
01 Apr 2023
IEEE Transactions on Circuits and Systems II: Express Briefs | VOL. 70

Computing in Memory With Spin-Transfer Torque Magnetic RAM
Shubham Jain ... Anand Raghunathan
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 26
Shubham Jain, et. al.Shubham Jain ... Anand Raghunathan
01 Mar 2018
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 26

Survey on memory management techniques in heterogeneous computing systems
Anakhi Hazarika ... Hafizur Rahaman
IET Computers & Digital Techniques | VOL. 14
Anakhi Hazarika, et. al.Anakhi Hazarika ... Hafizur Rahaman
21 Jan 2020
IET Computers & Digital Techniques | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ComputeDRAM

Abstract

Talk to us

Similar Papers