Transparent GPU memory management for DNNs

Jungho Park,Hyungmin Cho,Jaejin Lee,Wookeun Jung

doi:10.1145/3200691.3178531

Transparent GPU memory management for DNNs

Jungho Park, Hyungmin Cho + Show 2 more

https://doi.org/10.1145/3200691.3178531

Copy DOI

Journal: ACM SIGPLAN Notices

Publication Date: Feb 10, 2018

Affiliation: Seoul National University, Hongik University

#GPU Memory #GPU Memory Capacity + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Modern DNN frameworks exploit GPU acceleration by default to achieve high performance. The limitation of GPU memory capacity becomes a serious problem because DNNs are becoming deeper and larger. This paper proposes a purely software-based transparent solution, called tvDNN, to the GPU memory capacity problem. It is based on GPU memory swapping and memory object sectioning techniques. It also provides an efficient memory-object swapping schedule based on ILP (optimal) and heuristics (suboptimal). The experimental results show that tvDNN enables Caffe to build VGG-16 with a large batch size, such as 256 or 512, using a few GB of GPU memory without significant performance degradation.

Full Text