Approximate Memory Compression

Ashish Ranjan,Anand Raghunathan,Vijay Raghunathan,Arnab Raha

doi:10.1109/tvlsi.2020.2970041

Abstract

Memory subsystems are a major energy bottleneck in computing platforms due to frequent transfers between processors and off-chip memory. We propose approximate memory compression, a technique that leverages the intrinsic resilience of emerging workloads such as machine learning and data analytics to reduce off-chip memory traffic, thereby improving energy and performance. We realize approximate memory compression by enhancing the memory controller to be aware of approximate memory regions—regions in memory that contain approximation-resilient data—and to transparently compress (decompress) the data written to (read from) these regions. To provide control over approximations, each approximate memory region is associated with an error constraint such as the maximum error that may be introduced in each data element. The quality-aware memory controller subjects memory transactions to a compression scheme that introduces approximations, thereby reducing memory traffic, while adhering to the specified error constraint for each approximate memory region. A software interface is provided to allow programmers to identify data structures (DSs) that are resilient to approximations. A runtime quality control framework automatically determines the error constraints for the identified DSs such that a given target application-level quality is maintained. We evaluate our proposal by applying it to three different main memory technologies in the context of a general-purpose computing system—DDR3 DRAM, LPDDR3 DRAM, and spin-transfer torque magnetic RAM (STT-MRAM). To demonstrate the feasibility of the proposed concepts, we also implement a hardware prototype using the Intel UniPHY-DDR3 memory controller and Nios-II processor, a Hynix DDR3 DRAM module, and a Stratix-IV field-programmable gate array (FPGA) development board. Across a wide range of machine learning benchmarks, approximate memory compression obtains significant benefits in main memory energy ( $1.18\times $ for DDR3 DRAM, $1.52\times $ for LPDDR3 DRAM, and $2.0\times $ for STT-MRAM) and a simultaneous improvement in execution time (5.2% for DDR3 DRAM, 5.4% for LPDDR3 DRAM, and 9.3% for STT-MRAM) with nearly identical application output quality.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Very Large Scale Integration (VLSI) Systems	Publication Date: Apr 1, 2020
Citations: 41	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

Approximate Memory Compression

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Lead the way for us

Similar Papers

Approximate memory compression for energy-efficiency
Ashish Ranjan ... Vijay Raghunathan
-
Ashish Ranjan, et. al.Ashish Ranjan ... Vijay Raghunathan
01 Jul 2017
01 Jul 2017

One-step majority-logic-decodable codes enable STT-MRAM for high speed working memories
Weisheng Zhao ... Wang Kang
-
Weisheng Zhao, et. al.Weisheng Zhao ... Wang Kang
01 Aug 2014
01 Aug 2014

Performance and Power Estimation of STT-MRAM Main Memory with Reliable System-level Simulation
Petar Radojković ... Rommel Sánchez Verdejo
ACM Transactions on Embedded Computing Systems | VOL. 21
Petar Radojković, et. al.Petar Radojković ... Rommel Sánchez Verdejo
14 Jan 2022
ACM Transactions on Embedded Computing Systems | VOL. 21

Write Pulse Scaling for Energy Efficient STT-MRAM
Rami Melhem ... Alex K Jones
-
Rami Melhem, et. al.Rami Melhem ... Alex K Jones
01 Jul 2016
01 Jul 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Approximate Memory Compression

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Very Large Scale Integration (VLSI) Systems