An Analysis of Load Imbalance in Scale-out Data Serving

Novakovic, Stanko; Daglis, Alexandros; Bugnion, Edouard; Falsafi, Babak; Grot, Boris

doi:10.1145/2896377.2901501

conference paper

An Analysis of Load Imbalance in Scale-out Data Serving

•

•

2016

Proceedings of the 2016 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Science

ACM SIGMETRICS

Despite the natural parallelism across lookups, performance of distributed key-value stores is often limited due to load imbalance induced by heavy skew in the popularity distribution of the dataset. To avoid violating service level objectives expressed in terms of tail latency, systems tend to keep server utilization low and organize the data in micro-shards, which in turn provides units of migration and replication for the purpose of load balancing. These techniques reduce the skew, but incur additional monitoring, data replication and consistency maintenance overheads. This work shows that the trend towards extreme scale-out will further exacerbate the skew-induced load imbalance, and hence the overhead of migration and replication.

Name

sigmetrics16-skew.pdf

Access type

openaccess

Size

233.1 KB

Format

Adobe PDF

Checksum (MD5)

cf80cf391ba4adee456024d24ca16880