CHAMELEON: A Dynamically Reconfigurable Heterogeneous Memory System

Jagadish B Kotra,Alaa R Alameldeen,Chris Wilkerson,Haibo Zhang,Mahmut T Kandemir

doi:10.1109/micro.2018.00050

Abstract

Modern computing systems and applications have growing demand for memories with higher bandwidth. This demand can be alleviated using fast, large on-die or die-stacked memories. They are typically used with traditional DRAM as part of a heterogeneous memory system and used either as a DRAM cache or as a hardware- or OS-managed part of memory (PoM). Caches adapt rapidly to application needs and typically provide higher performance but reduce the total OS-visible memory capacity. PoM architectures increase the total OS-visible memory capacity but exhibit additional overheads due to swapping large blocks of data between fast and slow memory. In this paper, we propose Chameleon, a hybrid architecture that bridges the gap between cache and PoM architectures. When applications need a large memory, Chameleon uses both fast and slow memories as PoM, maximizing the available space for the application. When the application's footprint is smaller than the total physical memory capacity, Chameleon opportunistically uses free space in the system as a hardware-managed cache. Chameleon is a hardware-software co-designed system where the OS notifies the hardware of pages that are allocated or freed, and hardware decides on switching memory regions between PoM- and cache-modes dynamically. Based on our evaluation of multi-programmed workloads on a system with 4GB fast memory and 20GB slow memory, Chameleon improves the average performance by 11.6% over PoM and 24.2% over a latency-optimized cache.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CHAMELEON: A Dynamically Reconfigurable Heterogeneous Memory System

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Morphable DRAM Cache Design for Hybrid Memory Systems
Sanghoon Cha ... Chang Hyun Park
ACM Transactions on Architecture and Code Optimization | VOL. 16
Sanghoon Cha, et. al.Sanghoon Cha ... Chang Hyun Park
18 Jul 2019
ACM Transactions on Architecture and Code Optimization | VOL. 16

SELF: A High Performance and Bandwidth Efficient Approach to Exploiting Die-Stacked DRAM as Part of Memory
Yuhua Guo ... Ping Huang
-
Yuhua Guo, et. al.Yuhua Guo ... Ping Huang
01 Sep 2017
01 Sep 2017

Sentinel: Efficient Tensor Migration and Allocation on Heterogeneous Memory Systems for Deep Learning
Jie Ren ... Kai Wu
-
Jie Ren, et. al.Jie Ren ... Kai Wu
01 Feb 2021
01 Feb 2021

SILC-FM: Subblocked InterLeaved Cache-Like Flat Memory Organization
Jee Ho Ryoo ... Mitesh R Meswani
-
Jee Ho Ryoo, et. al.Jee Ho Ryoo ... Mitesh R Meswani
01 Feb 2017
01 Feb 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CHAMELEON: A Dynamically Reconfigurable Heterogeneous Memory System

Abstract

Talk to us

Similar Papers