Let’s be honest: An optimal no-regret framework for zero-sum games

Asadi Kangarshahi, Ehsan; Hsieh, Ya-Ping; Sahin, Mehmet Fatih; Cevher, Volkan

Asadi Kangarshahi, Ehsan; Hsieh, Ya-Ping; Sahin, Mehmet Fatih; Cevher, Volkan

2018

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

We revisit the problem of solving two-player zero- sum games in the decentralized setting. We pro- pose a simple algorithmic framework that simulta- neously achieves the best rates for honest regret as well as adversarial regret, and in addition resolves the open problem of removing the logarithmic terms in convergence to the value of the game. We achieve this goal in three steps. First, we provide a novel analysis of the optimistic mirror descent (OMD), showing that it can be modified to guarantee fast convergence for both honest re- gret and value of the game, when the players are playing collaboratively. Second, we propose a new algorithm, dubbed as robust optimistic mir- ror descent (ROMD), which attains optimal ad- versarial regret without knowing the time horizon beforehand. Finally, we propose a simple signal- ing scheme, which enables us to bridge OMD and ROMD to achieve the best of both worlds. Numerical examples are presented to support our theoretical claims and show that our non-adaptive ROMD algorithm can be competitive to OMD with adaptive step-size selection.

Details

Title Let’s be honest: An optimal no-regret framework for zero-sum games

Author(s) Asadi Kangarshahi, Ehsan ; Hsieh, Ya-Ping ; Sahin, Mehmet Fatih ; Cevher, Volkan

Published in Proceedings of the 35th International Conference on Machine Learning

Pagination 9

Series Not Applicable

Conference 35th International Conference on Machine Learning (ICML), Stockholm, Sweden, July 10-15, 2018

Date 2018

Keywords

Zero-Sum Games; No-Regret Algorithms; ml-ai

Laboratories LIONS

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIONS - Laboratory for Information and Inference Systems
Peer-reviewed publications
Conference Papers
Work produced at EPFL

Record creation date 2018-02-12

Files

Abstract

Details

PDF