What-if Analysis with Conflicting Goals: Recommending Data Ranges for Exploration

Nguyen Quoc Viet Hung; Zheng, Kai; Weidlich, Matthias; Zheng, Bolong; Yin, Hongzhi; Nguyen Thanh Tam; Stantic, Bela

doi:10.1109/ICDE.2018.00018

Nguyen Quoc Viet Hung; Zheng, Kai; Weidlich, Matthias; Zheng, Bolong; Yin, Hongzhi; Nguyen Thanh Tam; Stantic, Bela

2018

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

What-if analysis is a data-intensive exploration to inspect how changes in a set of input parameters of a model influence some outcomes. It is motivated by a user trying to understand the sensitivity of a model to a certain parameter in order to reach a set of goals that are defined over the outcomes. To avoid an exploration of all possible combinations of parameter values, efficient what-if analysis calls for a partitioning of parameter values into data ranges and a unified representation of the obtained outcomes per range. Traditional techniques to capture data ranges, such as histograms, are limited to one outcome dimension. Yet, in practice, what-if analysis often involves conflicting goals that are defined over different dimensions of the outcome. Working on each of those goals independently cannot capture the inherent trade-off between them. In this paper, we propose techniques to recommend data ranges for what-if analysis, which capture not only data regularities, but also the trade-off between conflicting goals. Specifically, we formulate a parametric data partitioning problem and propose a method to find an optimal solution for it. Targeting scalability to large datasets, we further provide a heuristic solution to this problem. By theoretical and empirical analyses, we establish performance guarantees in terms of runtime and result quality.

Details

Title What-if Analysis with Conflicting Goals: Recommending Data Ranges for Exploration

Author(s) Nguyen Quoc Viet Hung ; Zheng, Kai ; Weidlich, Matthias ; Zheng, Bolong ; Yin, Hongzhi ; Nguyen Thanh Tam ; Stantic, Bela

Published in 2018 IEEE 34th International Conference on Data Engineering (Icde)

Series IEEE International Conference on Data Engineering

Pages 89-100

Conference 34th IEEE International Conference on Data Engineering Workshops (ICDEW), Paris, FRANCE, Apr 16-19, 2018

Date 2018-01-01

Publisher New York, IEEE

ISSN 1084-4627

ISBN 978-1-5386-5520-7

DOI https://doi.org/10.1109/ICDE.2018.00018

Other identifier(s) View record in Web of Science

Laboratories LSIR

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > LSIR - Distributed Information Systems Laboratory
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Record creation date 2019-11-10