Managing the topology of heterogeneous cluster nodes with hardware locality (hwloc)

Brice Goglin

doi:10.1109/hpcsim.2014.6903671

Abstract

Modern computing platforms are increasingly complex, with multiple cores, shared caches, and NUMA architectures. Parallel applications developers have to take locality into account before they can expect good efficiency on these platforms. Thus there is a strong need for a portable tool gathering and exposing this information. The Hardware Locality project (hwloc) offers a tree representation of the hardware based on the inclusion and localities of the CPU and memory resources. It is already widely used for affinity-based task placement in high performance computing. In this article we present how hwloc is extended to describe more than computing and memory resources. Indeed, I/O device locality is becoming another important aspect of locality since high performance GPUs, network or InfiniBand interfaces possess privileged access to some of the cores and memory banks. hwloc integrates this knowledge into its topology representation and offers an interoperability API to extend existing libraries such as CUDA with locality information. We also describe how hwloc now helps process managers and batch schedulers to deal with the topology of multiple cluster nodes, together with compression for better scalability up to thousands of nodes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Managing the topology of heterogeneous cluster nodes with hardware locality (hwloc)

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

MasFS: File System Based on Memory and SSD in Compute Nodes for High Performance Computers
Xin Liu ... Yutong Lu
-
Xin Liu, et. al.Xin Liu ... Yutong Lu
01 Dec 2016
01 Dec 2016

Memory resource considerations in the load balancing of software DSM systems
Yen-Tso Liu ... Chi-Ting Huang
-
Yen-Tso Liu, et. al. Yen-Tso Liu ... Chi-Ting Huang
27 Oct 2003
27 Oct 2003

Optimizing main-memory join on modern hardware
S Manegold ... M Kersten
IEEE Transactions on Knowledge and Data Engineering | VOL. 14
S Manegold, et. al.S Manegold ... M Kersten
01 Jan 1998
IEEE Transactions on Knowledge and Data Engineering | VOL. 14

Cluster Managers
Mohammed Guller
-
Mohammed GullerMohammed Guller
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Managing the topology of heterogeneous cluster nodes with hardware locality (hwloc)

Abstract

Talk to us

Similar Papers