Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. No-Regret Learning in Unknown Games with Correlated Payoffs
 
conference paper

No-Regret Learning in Unknown Games with Correlated Payoffs

Sessa, Pier Giuseppe
•
Bogunovic, Ilija
•
Kamgarpour, Maryam  
Show more
2019
Advances in Neural Information Processing Systems 32 (NeurIPS 2019)
Advances in Neural Information Processing System

We consider the problem of learning to play a repeated multi-agent game with an unknown reward function. Single player online learning algorithms attain strong regret bounds when provided with full information feedback, which unfortunately is unavailable in many real-world scenarios. Bandit feedback alone, i.e., observing outcomes only for the selected action, yields substantially worse performance. In this paper, we consider a natural model where, besides a noisy measurement of the obtained reward, the player can also observe the opponents' actions. This feedback model, together with a regularity assumption on the reward function, allows us to exploit the correlations among different game outcomes by means of Gaussian processes (GPs). We propose a novel confidence-bound based bandit algorithm GP-MW, which utilizes the GP model for the reward function and runs a multiplicative weight (MW) method. We obtain novel kernel-dependent regret bounds that are comparable to the known bounds in the full information setting, while substantially improving upon the existing bandit results. We experimentally demonstrate the effectiveness of GP-MW in random matrix games, as well as real-world problems of traffic routing and movie recommendation. In our experiments, GP-MW consistently outperforms several baselines, while its performance is often comparable to methods that have access to full information feedback.

  • Details
  • Metrics
Type
conference paper
Author(s)
Sessa, Pier Giuseppe
Bogunovic, Ilija
Kamgarpour, Maryam  
Krause, Andreas
Date Issued

2019

Publisher

Curran Associates, Inc.

Publisher place

Vancouver

Published in
Advances in Neural Information Processing Systems 32 (NeurIPS 2019)
Volume

32

URL
https://proceedings.neurips.cc/paper/2019/file/685217557383cd194b4f10ae4b39eebf-Paper.pdf
Editorial or Peer reviewed

REVIEWED

Written at

OTHER

EPFL units
SYCAMORE  
Event nameEvent placeEvent date
Advances in Neural Information Processing System

Vancouver

2019

Available on Infoscience
December 1, 2021
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/183335
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés