LAPT: A locality-aware page table for thread and data mapping

Eduardo H.M Cruz,Laércio L Pilla,Matthias Diener,Marco A.Z Alves,Philippe O.A Navaux

doi:10.1016/j.parco.2015.12.001

Abstract

Abstract The performance and energy efficiency of current systems is influenced by accesses to the memory hierarchy. One important aspect of memory hierarchies is the introduction of different memory access times, depending on the core that requested the transaction, and which cache or main memory bank responded to it. In this context, the locality of the memory accesses plays a key role for the performance and energy efficiency of parallel applications. Accesses to remote caches and NUMA nodes are more expensive than accesses to local ones. With information about the memory access pattern, pages can be migrated to the NUMA nodes that access them (data mapping), and threads that communicate can be migrated to the same node (thread mapping). In this paper, we present LAPT, a hardware-based mechanism to store the memory access pattern of parallel applications in the page table. The operating system uses the detected memory access pattern to perform an optimized thread and data mapping during the execution of the parallel application. Experiments with a wide range of parallel applications (from the NAS and PARSEC Benchmark Suites) on a NUMA machine showed significant performance and energy efficiency improvements of up to 19.2% and 15.7%, respectively, (6.7% and 5.3% on average).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

LAPT: A locality-aware page table for thread and data mapping

Abstract

Talk to us

Similar Papers

More From: Parallel Computing

Lead the way for us

Journal: Parallel Computing	Publication Date: Dec 11, 2015
Citations: 10

Similar Papers

Optimizing Memory Locality Using a Locality-Aware Page Table
Eduardo H.M Cruz ... Laercio L Pilla
-
Eduardo H.M Cruz, et. al.Eduardo H.M Cruz ... Laercio L Pilla
01 Oct 2014
01 Oct 2014

Online Thread and Data Mapping Using a Sharing-Aware Memory Management Unit
Eduardo H M Cruz ... Laércio L Pilla
ACM Transactions on Modeling and Performance Evaluation of Computing Systems | VOL. 5
Eduardo H M Cruz, et. al.Eduardo H M Cruz ... Laércio L Pilla
31 Dec 2020
ACM Transactions on Modeling and Performance Evaluation of Computing Systems | VOL. 5

A template library to integrate thread scheduling and locality management for NUMA multiprocessors
...
-
, et. al. ...
07 Jun 2012
07 Jun 2012

Sharing-Aware Mapping and Parallel Architectures
Eduardo H M Cruz ... Matthias Diener
-
Eduardo H M Cruz, et. al.Eduardo H M Cruz ... Matthias Diener
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

LAPT: A locality-aware page table for thread and data mapping

Abstract

Talk to us

Similar Papers

More From: Parallel Computing