Lock-free Memory Research Articles

Streaming graph processing performs batched updates and analytics on a time-evolving graph. The underlying representation format of the graph largely determines the throughputs of these updates and analytics phases. Existing representation formats usually employ variations of hash tables or adjacency lists. However, a recent study showed that the adjacency-list-based approaches perform poorly on heavy-tailed graphs, and the hash table-based approaches suffer on short-tailed graphs. We propose GraphTango, a hybrid representation format that provides excellent update and analytics throughput regardless of the graph’s degree distribution. GraphTango dynamically switches among three different formats based on a vertex’s degree: (i) Low-degree vertices store the edges directly with the neighborhood metadata, confining accesses to a single cache line, (2) Medium-degree vertices use adjacency lists, and (3) High-degree vertices use hash tables as well as adjacency lists. In this case, the adjacency list provides fast traversal during the analytics phase, while the hash table provides constant-time lookups during the update phase. We further optimized the performance by designing an open-addressing-based hash table that fully utilizes every fetched cache line. In addition, we developed a thread-local lock-free memory pool that allows fast growing/shrinking of the adjacency lists and hash tables in a multi-threaded environment. We evaluated GraphTango with the help of the SAGA-Bench framework and compared it with four other representation formats: Stinger, Degree-aware Robin Hood Hashing, and two adjacency list-based formats with different workload balancing scheme. On average, GraphTango provides 4.5x higher insertion throughput, 3.2x higher deletion throughput, and 1.1x higher analytics throughput over the next best format. Furthermore, we integrated GraphTango with the state-of-the-art graph processing frameworks DZiG and RisGraph. Compared to the vanilla DZiG and vanilla RisGraph, [GraphTango + DZiG] and [GraphTango + RisGraph] reduces the average batch processing time by 2.3x and 1.5x, respectively.

Read full abstract

Dynamic memory allocators (malloc/free) rely on mutual exclusion locks for protecting the consistency of their shared data structures under multithreading. The use of locking has many disadvantages with respect to performance, availability, robustness, and programming flexibility. A lock-free memory allocator guarantees progress regardless of whether some threads are delayed or even killed and regardless of scheduling policies. This paper presents a completely lock-free memory allocator. It uses only widely-available operating system support and hardware atomic instructions. It offers guaranteed availability even under arbitrary thread termination and crash-failure, and it is immune to deadlock regardless of scheduling policies, and hence it can be used even in interrupt handlers and real-time applications without requiring special scheduler support. Also, by leveraging some high-level structures from Hoard, our allocator is highly scalable, limits space blowup to a constant factor, and is capable of avoiding false sharing. In addition, our allocator allows finer concurrency and much lower latency than Hoard. We use PowerPC shared memory multiprocessor systems to compare the performance of our allocator with the default AIX 5.1 libc malloc, and two widely-used multithread allocators, Hoard and Ptmalloc. Our allocator outperforms the other allocators in virtually all cases and often by substantial margins, under various levels of parallelism and allocation patterns. Furthermore, our allocator also offers the lowest contention-free latency among the allocators by significant margins.

Read full abstract

Lock-free Memory Research Articles

Related Topics

Articles published on Lock-free Memory

GraphTango: A Hybrid Representation Format for Efficient Streaming Graph Updates and Analysis

Wfspan: Wait-free Dynamic Memory Management

Decoupling lock-free data structures from memory reclamation for static analysis

Every data structure deserves lock-free memory reclamation

PLDI 2004

Automatic memory reclamation for lock-free data structures

Dynamic synthesis for relaxed memory models

Efficient and Reliable Lock-Free Memory Reclamation Based on Reference Counting

NBmalloc: Allocating Memory in a Lock-Free Manner

NOBLE

Scalable lock-free dynamic memory allocation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Lock-free Memory Research Articles

Related Topics

Articles published on Lock-free Memory

GraphTango: A Hybrid Representation Format for Efficient Streaming Graph Updates and Analysis

Wfspan: Wait-free Dynamic Memory Management

Decoupling lock-free data structures from memory reclamation for static analysis

Every data structure deserves lock-free memory reclamation

PLDI 2004

Automatic memory reclamation for lock-free data structures

Dynamic synthesis for relaxed memory models

Efficient and Reliable Lock-Free Memory Reclamation Based on Reference Counting

NBmalloc: Allocating Memory in a Lock-Free Manner

NOBLE

Scalable lock-free dynamic memory allocation