Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Alpine: Efficient In-Situ Data Exploration in the Presence of Updates
 
conference paper

Alpine: Efficient In-Situ Data Exploration in the Presence of Updates

Anagnostou, Antonios
•
Olma, Matthaios
•
Ailamaki, Anastasia
May 9, 2017
Proceeding SIGMOD '17 Proceedings of the 2017 ACM International Conference on Management of Data
SIGMOD International Conference on Management of Data

The ever growing data collections create the need for brief explorations of the available data to extract relevant information before decision making becomes necessary. In this context of data exploration, current data analysis solutions struggle to quickly pinpoint useful information in data collections. One major reason is that loading data in a DBMS without knowing which part of it will actually be useful is a major bottleneck. To remove this bottleneck, state-of-the art approaches perform queries in situ, thus avoiding the loading overhead. In situ query engines, however, are index-oblivious, and lack sophisticated techniques to reduce the amount of data to be accessed. Furthermore, applications constantly generate fresh data and update the existing raw data files whereas state-of-the art in situ approaches support only append-like workloads. In this demonstration, we showcase the efficiency of adaptive indexing and partitioning techniques for analytical queries in the presence of updates. We demonstrate an online partitioning and indexing tuner for in situ querying which plugs to a query engine and offers support for fast queries over raw data files. We present Alpine, our prototype implementation, which combines the tuner with a query executor incorporating in situ query techniques to provide efficient raw data access. We will visually demonstrate how Alpine incrementally and adaptively builds auxiliary data structures and indexes over raw data files and how it adapts its behavior as a side-effect of updates in the raw data files.

  • Files
  • Details
  • Metrics
Type
conference paper
DOI
10.1145/3035918.3058743
Author(s)
Anagnostou, Antonios
Olma, Matthaios
Ailamaki, Anastasia
Date Issued

2017-05-09

Publisher

ACM New York, NY, USA

Published in
Proceeding SIGMOD '17 Proceedings of the 2017 ACM International Conference on Management of Data
ISBN of the book

978-1-4503-4197-4

Total of pages

4

Start page

1651

End page

1654

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
DIAS  
Event nameEvent placeEvent date
SIGMOD International Conference on Management of Data

Chicago, Illinois, USA

May 14 - 19, 2017

Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/148225
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés