Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Scalable Multi-Query Execution using Reinforcement Learning
 
conference paper

Scalable Multi-Query Execution using Reinforcement Learning

Sioulas, Panagiotis  
•
Ailamaki, Anastasia  
2021
Proceedings of the 2021 International Conference on Management of Data SIGMOD '21
ACM SIGMOD International Conference on Management of Data

The growing demand for data-intensive decision support and the migration to multi-tenant infrastructures put databases under the stress of high analytical query load. The requirement for high throughput contradicts the traditional design of query-at-a-time databases that optimize queries for efficient serial execution. Sharing work across queries presents an opportunity to reduce the total cost of processing and therefore improve throughput with increasing query load. Systems can share work either by assessing all opportunities and restructuring batches of queries ahead of execution, or by inspecting opportunities in individual incoming queries at runtime: the former strategy scales poorly to large query counts, as it requires expensive sharing-aware optimization, whereas the latter detects only a subset of the opportunities. Both strategies fail to minimize the cost of processing for large and ad-hoc workloads. This paper presents RouLette, a specialized intelligent engine for multi-query execution that addresses, through runtime adaptation, the shortcomings of existing work-sharing strategies. RouLette scales by replacing sharing-aware optimization with adaptive query processing, and it chooses opportunities to explore and exploit by using reinforcement learning. RouLette also includes optimizations that reduce the adaptation overhead. RouLette increases throughput by 1.6-28.3x, compared to a state-of-the-art query-at-a-time engine, and up to 6.5x, compared to sharing-enabled prototypes, for multi-query workloads based on the schema of TPC-DS.

  • Files
  • Details
  • Metrics
Type
conference paper
DOI
10.1145/3448016.3452799
Author(s)
Sioulas, Panagiotis  
Ailamaki, Anastasia  
Date Issued

2021

Publisher

Association for Computing Machinery

Publisher place

New York

Published in
Proceedings of the 2021 International Conference on Management of Data SIGMOD '21
ISBN of the book

978-1-450383-43-1

Total of pages

13

Subjects

sharing

•

multi-query optimization

•

join

•

reinforcement learning

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
DIAS  
Event nameEvent placeEvent date
ACM SIGMOD International Conference on Management of Data

Virtual, China

June 20-25, 2021

Available on Infoscience
April 22, 2021
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/177464
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés