Constrained Binary Identification Problem

Karbasi, Amin; Zadimoghaddam, Morteza

2013

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

We consider the problem of building a binary decision tree, to locate an object within a set by way of the least number of membership queries. This problem is equivalent to the “20 questions game” of information theory and is closely related to lossless source compression. If any query is admissible, Huffman coding is optimal with close to H[P] questions on average, the entropy of the prior distribution P over objects. However, in many realistic scenarios, there are constraints on which queries can be asked, and solving the problem optimally is NP-hard. We provide novel polynomial time approximation algorithms where constraints are defined in terms of “graph", general “cost", and “submodular" functions. In particular, we show that under graph constraints, there exists a constant approximation algorithm for locating the target in the set. We then extend our approach for scenarios where the constraints are defined in terms of general cost functions that depend only on the size of the query and provide an approxima- tion algorithm that can find the target within O(log(log n)) gap from the cost of the optimum algorithm. Submodular functions come as a natural generalization of cost functions with decreas- ing marginals. Under submodular set constraints, we devise an approximation algorithm that can find the target within O(log n) gap from the cost of the optimum algorithm. The proposed algorithms are greedy in a sense that at each step they select a query that most evenly splits the set without violating the underlying constraints. These results can be applied to network tomography, active learning and interactive content search.

Details

Title Constrained Binary Identification Problem

Author(s) Karbasi, Amin ; Zadimoghaddam, Morteza

Published in Proceedings of the 30th Symposium on Theoretical Aspects of Computer Science

Conference 30th Symposium on Theoretical Aspects of Computer Science, Kiel, Germany

Date 2013

Keywords

Graph Algorithms and Network Problems; Analysis of Approximation Algorithms; Information Search and Retrieval

Laboratories LCAV
LTHC

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > LCAV - Audio Visual Communications Laboratory
Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > LTHC - Communication Theories Laboratory
Scientific production and competences > Euler Center for Signal Processing
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Record creation date 2013-01-29

Actions

Preview

Select file: