A FFINITY: Efficiently Querying Statistical Measures on Time-Series Data

Sathe, Saket; Aberer, Karl

doi:10.1109/ICDE.2013.6544879

Sathe, Saket; Aberer, Karl

2013

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Computing statistical measures for large databases of time series is a fundamental primitive for querying and mining time-series data [1]–[6]. This primitive is gaining importance with the increasing number and rapid growth of time series databases. In this paper, we introduce a framework for efficient computation of statistical measures by exploiting the concept of affine relationships. Affine relationships can be used to infer statistical measures for time series, from other related time series, instead of computing them directly; thus, reducing the overall computational cost significantly. The resulting methods exhibit at least one order of magnitude improvement over the best known methods. To the best of our knowledge, this is the first work that presents an unified approach for computing and querying several statistical measures at once. Our approach exploits affine relationships using three key components. First, the AFCLST algorithm clusters the time-series data, such that high-quality affine relationships could be easily found. Second, the SYMEX algorithm uses the clustered time series and efficiently computes the desired affine relationships. Third, the SCAPE index structure produces a many-fold im- provement in the performance of processing several statistical queries by seamlessly indexing the affine relationships. Finally, we establish the effectiveness of our approaches by performing comprehensive experimental evaluation on real datasets.

Details

Title A FFINITY: Efficiently Querying Statistical Measures on Time-Series Data

Author(s) Sathe, Saket ; Aberer, Karl

Published in 2013 IEEE 29Tth International Conference On Data Engineering (ICDE)

Pages 841-852

Conference 29th International Conference on Data Engineering (ICDE), Brisbane, Australia, April 8-12, 2013

Date 2013

Publisher New York, IEEE

ISBN 978-1-4673-4909-3

DOI https://doi.org/10.1109/ICDE.2013.6544879

Other identifier(s) View record in Web of Science

Laboratories LSIR

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > LSIR - Distributed Information Systems Laboratory
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Record creation date 2012-12-03

Files

Abstract

Details

PDF