Spectral MAB for Unknown Graph Processes

Toni, Laura; Frossard, Pascal

doi:10.23919/EUSIPCO.2018.8553372

2018

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

In this work, we study graph-based multi-arms bandit (MAB) problems aimed at optimizing actions on irregular and high-dimensional graphs. More formally, we consider a decision-maker that takes sequential actions over time and observes the experienced reward, defined as a function of a sparse graph signal. The goal is to optimize the action policy, which maximizes the reward experienced over time. The main challenges are represented by the system uncertainty (i.e., unknown parameters of the sparse graph signal model) and the high-dimensional search space. The uncertainty can be faced by online learning strategies that infer the system dynamics while taking the appropriate actions. However, the high-dimensionality makes online learning strategies highly inefficient. To overcome this limitation, we propose a novel graph-based MAB algorithm, which is data-efficient also in high-dimensional systems. The key intuition is to infer the nature of the graph processes by learning in the graph-spectral domain, and exploit this knowledge while optimizing the actions. In particular, we model the graph signal with a sparse dictionary-based representation and we propose an online sequential decision strategy that learns the parameters of the graph processes while optimizing the action strategy.

Details

Title Spectral MAB for Unknown Graph Processes

Author(s) Toni, Laura ; Frossard, Pascal

Published in Proceedings of EUSIPCO

Series European Signal Processing Conference

Pages 116-120

Conference European Signal Processing Conference (EUSIPCO), Rome, ITALY, Aug 03-07, 2018

Date 2018

Publisher Los Alamitos, IEEE Computer Society

ISSN 2076-1465

ISBN 978-90-827970-1-5

DOI https://doi.org/10.23919/EUSIPCO.2018.8553372

Other identifier(s) View record in Web of Science

Laboratories LTS4

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LTS4 - Signal Processing Laboratory 4
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Record creation date 2019-01-25

Abstract

Details

Actions