Scaling up Mixed Workloads: a Battle of Data Freshness, Flexibility, and Scheduling

Psaroudakis, Iraklis; Wolf, Florian; May, Norman; Neumann, Thomas; Böhm, Alexander; Ailamaki, Anastasia; Sattler, Kai-Uwe

doi:10.1007/978-3-319-15350-6_7

Psaroudakis, Iraklis; Wolf, Florian; May, Norman; Neumann, Thomas; Böhm, Alexander; Ailamaki, Anastasia; Sattler, Kai-Uwe

2015

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

The common "one size does not fit all" paradigm isolates transactional and analytical workloads into separate, specialized database systems. Operational data is periodically replicated to a data warehouse for analytics. Competitiveness of enterprises today, however, depends on real-time reporting on operational data, necessitating an integration of transactional and analytical processing in a single database system. The mixed workload should be able to query and modify common data in a shared schema. The database needs to provide performance guarantees for transactional workloads, and, at the same time, efficiently evaluate complex analytical queries. In this paper, we share our analysis of the performance of two main-memory databases that support mixed workloads, SAP HANA and HyPer, while evaluating the mixed workload CH-benCHmark. By examining their similarities and differences, we identify the factors that affect performance while scaling the number of concurrent transactional and analytical clients. The three main factors are (a) data freshness, i.e., how recent is the data processed by analytical queries, (b) flexibility, i.e., restricting transactional features in order to increase optimization choices and enhance performance, and (c) scheduling, i.e., how the mixed workload utilizes resources. Specifically for scheduling, we show that the absence of workload management under cases of high concurrency leads to analytical workloads overwhelming the system and severely hurting the performance of transactional workloads.

Details

Title Scaling up Mixed Workloads: a Battle of Data Freshness, Flexibility, and Scheduling

Author(s) Psaroudakis, Iraklis ; Wolf, Florian ; May, Norman ; Neumann, Thomas ; Böhm, Alexander ; Ailamaki, Anastasia ; Sattler, Kai-Uwe

Published in Performance Characterization and Benchmarking. Traditional to Big Data. TPCTC 2014

Series Lecture Notes in Computer Science, 8904

Pages 97-112

Conference Sixth TPC Technology Conference on Performance Evaluation & Benchmarking (TPCTC 2014), Hangzhou, China, September 1-5, 2014

Date 2015

Publisher Springer International Publishing AG

ISBN 978-3-319-15350-6
978-3-319-15349-0

Keywords

OLAP; OLTP; CH-benCHmark; SAP HANA; HyPer; data freshness; flexibility; scheduling; workload management

Note Published by: Springer International Publishing AG. Benchmark source code can be found in the referenced URL.

DOI https://doi.org/10.1007/978-3-319-15350-6_7

Other identifier(s) View record in Web of Science

Additional link URL

Laboratories DIAS

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > DIAS - Data-Intensive Applications and Systems Laboratory
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Record creation date 2014-09-25

Files

Abstract

Details

PDF