Hailstorm: Disaggregated Compute and Storage for Distributed LSM-based Databases

Zwaenepoel, Willy

doi:10.1145/3373376.3378504

conference paper

Hailstorm: Disaggregated Compute and Storage for Distributed LSM-based Databases

Bindschaedler, Laurent

•

Goel, Ashvin

•

Zwaenepoel, Willy

March 16, 2020

Proceedings of the 25th ACM International Conference on Architectural Support for Programming Languages and Operating Systems

25th ACM International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS ’20

Distributed LSM-based databases face throughput and latency issues due to load imbalance across instances and interference from background tasks such as flushing, compaction, and data migration. Hailstorm addresses these problems by deploying the database storage engines over a distributed filesystem that disaggregates storage from processing, enabling storage pooling and compaction offloading. Hailstorm pools storage devices within a rack, allowing each storage engine to fully utilize the aggregate rack storage capacity and bandwidth. Storage pooling successfully handles load imbalance without the need for resharding. Hailstorm offloads compaction tasks to remote nodes, distributing their impact, and improving overall system throughput and response time. We show that Hailstorm achieves load balance in many MongoDB deployments with skewed workloads, improving the average throughput by 60%, while decreasing tail latency by as much as 5X. In workloads with range queries, Hailstorm provides up to 22X throughput improvements. Hailstorm also enables cost savings of 47-56% in OLTP workloads.

Name

hailstorm-bindschaedler.pdf

Type

Publisher's Version

Version

http://purl.org/coar/version/c_970fb48d4fbd8a85

Access type

openaccess

Size

1007.52 KB

Format

Adobe PDF

Checksum (MD5)

589e4c7c44dcfa8a068aef3cdea54dc8