Machine Learning Guided Optimal Use of GPU Unified Memory

Hailu Xu,Murali Emani,Pei-Hung Lin,Liting Hu,Chunhua Liao

doi:10.1109/mchpc49590.2019.00016

Abstract

NVIDIA's unified memory (UM) creates a pool of managed mem- ory on top of physically separated CPU and GPU memories. UM automatically migrates page-level data on-demand so program- mers can quickly write CUDA codes on heterogeneous machines without tedious and error-prone manual memory management. To improve performance, NVIDIA allows advanced programmers to pass additional memory use hints to its UM driver. However, it is extremely difficult for programmers to decide when and how to effi- ciently use unified memory, given the complex interactions between applications and hardware. In this paper, we present a machine learning-based approach to choosing between discrete memory and unified memory, with additional consideration of different memory hints. Our approach utilizes profiler-generated metrics of CUDA programs to train a model offline, which is later used to guide opti- mal use of UM for multiple applications at runtime. We evaluate our approach on NVIDIA Volta GPU with a set of benchmarks. Results show that the proposed model achieves 96% prediction accuracy in correctly identifying the optimal memory advice choice.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Machine Learning Guided Optimal Use of GPU Unified Memory

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

XUnified: A Framework for Guiding Optimal Use of GPU Unified Memory
Hailu Xu ... Murali Emani
IEEE Access | VOL. 10
Hailu Xu, et. al.Hailu Xu ... Murali Emani
01 Jan 2021
IEEE Access | VOL. 10

UVMMU: Hardware-Offloaded Page Migration for Heterogeneous Computing
Jihun Park ... Donghun Jeong
-
Jihun Park, et. al.Jihun Park ... Donghun Jeong
01 Apr 2023
01 Apr 2023

Designing Energy-Efficient Intermittently Powered Systems Using Spin-Hall-Effect-Based Nonvolatile SRAM
Arnab Raha ... Syed Shakib Sarwar
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 26
Arnab Raha, et. al.Arnab Raha ... Syed Shakib Sarwar
01 Feb 2018
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 26

OMB-UM: Design, Implementation, and Evaluation of CUDA Unified Memory Aware MPI Benchmarks
Karthik Vadambacheri Manian ... Ching-Hsiang Chu
-
Karthik Vadambacheri Manian, et. al.Karthik Vadambacheri Manian ... Ching-Hsiang Chu
01 Nov 2019
01 Nov 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Machine Learning Guided Optimal Use of GPU Unified Memory

Abstract

Talk to us

Similar Papers