Formal Approaches to Querying Big Data in Shared-Nothing Systems

Ketsman, Bas

doi:10.1145/3299869.3328524

Ketsman, Bas

2019

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

To meet today's data management needs, it is a widespread practice to use distributed data storage and processing systems. Since the publication of the MapReduce paradigm, a plethora of such systems arose, but although widespread, the capabilities of these systems are still poorly understood and putting them to effective use is often more of an art than a science.

As one of the causes for this observation, we identify a lack of theoretical underpinnings for these systems, which makes it hard to understand what the advantages and disadvantages of the particular systems are and which, in addition, complicates the choice of a particular formalism for a particular task. In my PhD thesis, we zoom in on several important aspects of query evaluation using clusters of servers, including coordination and communication, data-skew, load balancing, and data partitioning, and propose a set of elegant and theoretically sound frameworks and theories that help to understand the applicable limitations and trade-offs.

Details

Title Formal Approaches to Querying Big Data in Shared-Nothing Systems

Author(s) Ketsman, Bas

Published in Sigmod '19: Proceedings Of The 2019 International Conference On Management Of Data

Series International Conference on Management of Data

Pages 1115-1116

Conference ACM SIGMOD International Conference on Management of Data (SIGMOD), Jun 30-Jul 05, 2019, Amsterdam, NETHERLANDS

Date 2019-01-01

Publisher New York, ASSOC COMPUTING MACHINERY

ISSN 0730-8078

ISBN 978-1-4503-5643-5

DOI https://doi.org/10.1145/3299869.3328524

Other identifier(s) View record in Web of Science

Laboratories DATA

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > DATA - Data Analysis Theory and Applications Laboratory
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Record creation date 2019-12-26