Lock-Unlock: Is That All? A Pragmatic Analysis of Locking in Software Systems

Guerraoui, Rachid; Guiroux, Hugo; Lachaize, Renaud; Quema, Vivien; Trigonakis, Vasileios

doi:10.1145/3301501

research article

Lock-Unlock: Is That All? A Pragmatic Analysis of Locking in Software Systems

Guerraoui, Rachid

•

Guiroux, Hugo

•

Lachaize, Renaud

more

March 1, 2019

Acm Transactions On Computer Systems

A plethora of optimized mutex lock algorithms have been designed over the past 25 years to mitigate performance bottlenecks related to critical sections and locks. Unfortunately, there is currently no broad study of the behavior of these optimized lock algorithms on realistic applications that consider different performance metrics, such as energy efficiency and tail latency. In this article, we perform a thorough and practical analysis of synchronization, with the goal of providing software developers with enough information to design fast, scalable, and energy-efficient synchronization in their systems. First, we perform a performance study of 28 state-of-the-art mutex lock algorithms, on 40 applications, on four different multicore machines. We consider not only throughput (traditionally the main performance metric) but also energy efficiency and tail latency, which are becoming increasingly important. Second, we present an in-depth analysis in which we summarize our findings for all the studied applications. In particular, we describe nine different lock-related performance bottlenecks, and we propose six guidelines helping software developers with their choice of a lock algorithm according to the different lock properties and the application characteristics. From our detailed analysis, we make several observations regarding locking algorithms and application behaviors, several of which have not been previously discovered: (i) applications stress not only the lockunlock interface but also the full locking API (e.g., trylocks, condition variables); (ii) thememory footprint of a lock can directly affect the application performance; (iii) for many applications, the interaction between locks and scheduling is an important application performance factor; (vi) lock tail latencies may or may not affect application tail latency; (v) no single lock is systematically the best; (vi) choosing the best lock is difficult; and (vii) energy efficiency and throughput go hand in hand in the context of lock algorithms. These findings highlight that locking involves more considerations than the simple lock/ unlock interface and call for further research on designing low-memory footprint adaptive locks that fully and efficiently support the full lock interface, and consider all performance metrics.

Type

research article

DOI

10.1145/3301501

Web of Science ID

WOS:000462524200001

Authors

Guerraoui, Rachid

•

Guiroux, Hugo

•

Lachaize, Renaud

•

Quema, Vivien

•

Trigonakis, Vasileios

Publication date

2019-03-01

Published in

Acm Transactions On Computer Systems

Volume

36

Issue

1

Start page

1

Subjects

Computer Science, The...

Computer Science

multicore

synchronization

locks

performance bottlenec...

lock interface

synchronization

energy

memory

Peer reviewed

REVIEWED

EPFL units

DCL

Available on Infoscience

June 18, 2019

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/157387