Scalable Multi-Query Execution using Reinforcement Learning

Sioulas, Panagiotis; Ailamaki, Anastasia

doi:10.1145/3448016.3452799

conference paper

Scalable Multi-Query Execution using Reinforcement Learning

Sioulas, Panagiotis

•

Ailamaki, Anastasia

2021

Proceedings of the 2021 International Conference on Management of Data SIGMOD '21

ACM SIGMOD International Conference on Management of Data

The growing demand for data-intensive decision support and the migration to multi-tenant infrastructures put databases under the stress of high analytical query load. The requirement for high throughput contradicts the traditional design of query-at-a-time databases that optimize queries for efficient serial execution. Sharing work across queries presents an opportunity to reduce the total cost of processing and therefore improve throughput with increasing query load. Systems can share work either by assessing all opportunities and restructuring batches of queries ahead of execution, or by inspecting opportunities in individual incoming queries at runtime: the former strategy scales poorly to large query counts, as it requires expensive sharing-aware optimization, whereas the latter detects only a subset of the opportunities. Both strategies fail to minimize the cost of processing for large and ad-hoc workloads. This paper presents RouLette, a specialized intelligent engine for multi-query execution that addresses, through runtime adaptation, the shortcomings of existing work-sharing strategies. RouLette scales by replacing sharing-aware optimization with adaptive query processing, and it chooses opportunities to explore and exploit by using reinforcement learning. RouLette also includes optimizations that reduce the adaptation overhead. RouLette increases throughput by 1.6-28.3x, compared to a state-of-the-art query-at-a-time engine, and up to 6.5x, compared to sharing-enabled prototypes, for multi-query workloads based on the schema of TPC-DS.

Type

conference paper

DOI

10.1145/3448016.3452799

Author(s)

Sioulas, Panagiotis

Ailamaki, Anastasia

Date Issued

2021

Publisher

Association for Computing Machinery

Publisher place

New York

Published in

Proceedings of the 2021 International Conference on Management of Data SIGMOD '21

ISBN of the book

978-1-450383-43-1

Total of pages

13

Subjects

sharing

•

multi-query optimization

•

join

•

reinforcement learning

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units

DIAS

Event name	Event place	Event date
ACM SIGMOD International Conference on Management of Data	Virtual, China	June 20-25, 2021

Available on Infoscience

April 22, 2021

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/177464