Learning-based Memory Allocation for C++ Server Workloads

Martin Maas,Colin Raffel,Kathryn S Mckinley,David G Andersen,Mohammad Mahdi Javanmard,Michael Isard

doi:10.1145/3373376.3378525

Abstract

Modern C++ servers have memory footprints that vary widely over time, causing persistent heap fragmentation of up to 2x from long-lived objects allocated during peak memory usage. This fragmentation is exacerbated by the use of huge (2MB) pages, a requirement for high performance on large heap sizes. Reducing fragmentation automatically is challenging because C++ memory managers cannot move objects. This paper presents a new approach to huge page fragmentation. It combines modern machine learning techniques with a novel memory manager (LLAMA) that manages the heap based on object lifetimes and huge pages (divided into blocks and lines). A neural network-based language model predicts lifetime classes using symbolized calling contexts. The model learns context-sensitive per-allocation site lifetimes from previous runs, generalizes over different binary versions, and extrapolates from samples to unobserved calling contexts. Instead of size classes, LLAMA's heap is organized by lifetime classes that are dynamically adjusted based on observed behavior at a block granularity. LLAMA reduces memory fragmentation by up to 78% while only using huge pages on several production servers. We address ML-specific questions such as tolerating mispredictions and amortizing expensive predictions across application execution. Although our results focus on memory allocation, the questions we identify apply to other system-level problems with strict latency and resource requirements where machine learning could be applied.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning-based Memory Allocation for C++ Server Workloads

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Combining Machine Learning and Lifetime-Based Resource Management for Memory Allocation and Beyond
Martin Maas ... Mohammad Mahdi Javanmard
Communications of the ACM | VOL. 67
Martin Maas, et. al.Martin Maas ... Mohammad Mahdi Javanmard
25 Mar 2024
Communications of the ACM | VOL. 67

Adaptive huge-page subrelease for non-moving memory allocators in warehouse-scale computers
Martin Maas ... Kathryn S Mckinley
-
Martin Maas, et. al.Martin Maas ... Kathryn S Mckinley
22 Jun 2021
22 Jun 2021

Making Huge Pages Actually Useful
Ashish Panwar ... Aravinda Prasad
ACM SIGPLAN Notices | VOL. 53
Ashish Panwar, et. al.Ashish Panwar ... Aravinda Prasad
19 Mar 2018
ACM SIGPLAN Notices | VOL. 53

Making Huge Pages Actually Useful
Ashish Panwar ... K Gopinath
-
Ashish Panwar, et. al.Ashish Panwar ... K Gopinath
19 Mar 2018
19 Mar 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning-based Memory Allocation for C++ Server Workloads

Abstract

Talk to us

Similar Papers