Manycore network interfaces for in-memory rack-scale computing

Alexandros Daglis,Stanko Novaković,Edouard Bugnion,Babak Falsafi,Boris Grot

doi:10.1145/2749469.2750415

Alexandros Daglis, Stanko Novaković + Show 3 more

Open Access

PDF Available

https://doi.org/10.1145/2749469.2750415

Copy DOI

Export

Save

Cite

Publication Date: Jun 13, 2015
Citations: 24	License type: other-oa

Affiliation: University of Edinburgh

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

Datacenter operators rely on low-cost, high-density technologies to maximize throughput for data-intensive services with tight tail latencies. In-memory rack-scale computing is emerging as a promising paradigm in scale-out datacenters capitalizing on commodity SoCs, low-latency and high-bandwidth communication fabrics and a remote memory access model to enable aggregation of a rack's memory for critical data-intensive applications such as graph processing or key-value stores. Low latency and high bandwidth not only dictate eliminating communication bottlenecks in the software protocols and off-chip fabrics but also a careful on-chip integration of network interfaces. The latter is a key challenge especially in architectures with RDMA-inspired one-sided operations that aim to achieve low latency and high bandwidth through on-chip Network Interface (NI) support. This paper proposes and evaluates network interface architectures for tiled manycore SoCs for in-memory rack-scale computing. Our results indicate that a careful splitting of NI functionality per chip tile and at the chip's edge along a NOC dimension enables a rack-scale architecture to optimize for both latency and bandwidth. Our best manycore NI architecture achieves latencies within 3% of an idealized hardware NUMA and efficiently uses the full bisection bandwidth of the NOC, without changing the on-chip coherence protocol or the core's microarchitecture.

Full Text