A new regret analysis for Adam-type algorithms

Alacaoglu, Ahmet; Malitsky, Yura; Mertikopoulos, Panayotis; Cevher, Volkan

conference paper

Alacaoglu, Ahmet

•

Malitsky, Yura

•

Mertikopoulos, Panayotis

more

2020

Proceedings of the 37th International Conference on Machine Learning (ICML)

37th International Conference on Machine Learning (ICLM 2020)

In this paper, we focus on a theory-practice gap for Adam and its variants (AMSgrad, AdamNC, etc.). In practice, these algorithms are used with a constant first-order moment parameter 1 (typically between 0:9 and 0:99). In theory, regret guarantees for online convex optimization require a rapidly decaying 1 ! 0 schedule. We show that this is an artifact of the standard analysis and propose a novel framework that allows us to derive optimal, data-dependent regret bounds with a constant 1, without further assumptions. We also demonstrate the flexibility of our analysis on a wide range of different algorithms and settings.

Name

A new regret analysis.pdf

Type

Preprint

Access type

openaccess

Size

331.76 KB

Format

Adobe PDF

Checksum (MD5)

9ce67ceda9a9cb24125702d69c7ae46c