Manycore Network Interfaces for In-Memory Rack-Scale Computing

Daglis, Alexandros; Novakovic, Stanko; Bugnion, Edouard; Falsafi, Babak; Grot, Boris

doi:10.1145/2749469.2750415

conference paper

Manycore Network Interfaces for In-Memory Rack-Scale Computing

•

•

2015

Proceedings of the 42nd International Symposium in Computer Architecture

42nd International Symposium in Computer Architecture

Datacenter operators rely on low-cost, high-density technologies to maximize throughput for data-intensive services with tight tail latencies. In-memory rack-scale computing is emerging as a promising paradigm in scale-out datacenters capitalizing on commodity SoCs, low-latency and high-bandwidth communication fabrics and a remote memory access model to enable aggregation of a rack’s memory for critical data-intensive applications such as graph processing or key-value stores. Low latency and high bandwidth not only dictate eliminating communication bottlenecks in the software protocols and off-chip fabrics but also a careful on-chip integration of network interfaces. The latter is a key challenge especially in architectures with RDMA-inspired one-sided operations that aim to achieve low latency and high bandwidth through on-chip Network Interface (NI) support. This paper proposes and evaluates network interface architectures for tiled manycore SoCs for in-memory rack-scale computing. Our results indicate that a careful splitting of NI functionality per chip tile and at the chip’s edge along a NOC dimension enables a rack-scale architecture to optimize for both latency and bandwidth. Our best manycore NI architecture achieves latencies within 3% of an idealized hardware NUMA and efficiently uses the full bisection bandwidth of the NOC, without changing the on-chip coherence protocol or the core’s microarchitecture.

Name

manycoreNI_isca15.pdf

Access type

openaccess

Size

1022.53 KB

Format

Adobe PDF

Checksum (MD5)

aaaed4a6f52a32f8e9872b96251e9f51