Transformer: An OS-Supported Reconfigurable Hybrid Memory Architecture

Ye Chi,Hai Jin,Xiaofei Liao,Ganwei Peng,Haikun Liu

doi:10.3390/app122412995

Abstract

Non-volatile memories (NVMs) have aroused vast interest in hybrid memory systems due to their promising features of byte-addressability, high storage density, low cost per byte, and near-zero standby energy consumption. However, since NVMs have limited write endurance, high write latency, and high write energy consumption, it is still challenging to directly replace traditional dynamic random access memory (DRAM) with NVMs. Many studies propose to utilize NVM and DRAM in a hybrid memory system, and explore sophisticated memory management schemes to alleviate the impact of slow NVM on the performance of applications. A few studies architected DRAM and NVM in a cache/memory hierarchy. However, the storage and performance overhead of the cache metadata (i.e., tags) management is rather expensive in this hierarchical architecture. Some other studies architected NVM and DRAM in a single (flat) address space to form a parallel architecture. However, the hot page monitoring and migration are critical for the performance of applications in this architecture. In this paper, we propose Transformer, an OS-supported reconfigurable hybrid memory architecture to efficiently use DRAM and NVM without redesigning the hardware architecture. To identify frequently accessed (hot) memory pages for migration, we propose to count the number of page accesses in OSes by sampling the access bit of pages periodically. We further migrate the identified hot pages from NVM to DRAM to improve the performance of hybrid memory system. More importantly, Transformer can simulate a hierarchical hybrid memory architecture while DRAM and NVM are physically managed in a flat address space, and can dynamically shift the logical memory architecture between parallel and hierarchical architectures according to applications’ memory access patterns. Experimental results show that Transformer can improve the application performance by 62% on average (up to 2.7×) compared with an NVM-only system, and can also improve performance by up to 79% and 42% (21% and 24% on average) compared with hierarchical and parallel architectures, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Transformer: An OS-Supported Reconfigurable Hybrid Memory Architecture

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Journal: Applied Sciences	Publication Date: Dec 18, 2022
License type: CC BY 4.0

Similar Papers

A unified hybrid memory system for scalable deep learning and big data applications
Wei Rang ... Dazhao Cheng
Journal of Parallel and Distributed Computing | VOL. 186
Wei Rang, et. al.Wei Rang ... Dazhao Cheng
28 Dec 2023
Journal of Parallel and Distributed Computing | VOL. 186

Design and Simulation of Multi-tiered Heterogeneous Memory Architecture
Jinyuan Hu ... Xiaofei Liao
-
Jinyuan Hu, et. al.Jinyuan Hu ... Xiaofei Liao
01 Oct 2022
01 Oct 2022

Utility-Based Hybrid Memory Management
Yang Li ... Jongmoo Choi
-
Yang Li, et. al.Yang Li ... Jongmoo Choi
01 Sep 2017
01 Sep 2017

Straggler-Aware Parallel Graph Processing in Hybrid Memory Systems
Wei Liu ... Hai Jin
-
Wei Liu, et. al.Wei Liu ... Hai Jin
01 May 2021
01 May 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transformer: An OS-Supported Reconfigurable Hybrid Memory Architecture

Abstract

Talk to us

Similar Papers

More From: Applied Sciences