Error Feedback Fixes SignSGD and other Gradient Compression Schemes

Karimireddy, Sai Praneeth Reddy; Rebjock, Quentin; Stich, Sebastian Urban; Jaggi, Martin

Karimireddy, Sai Praneeth Reddy; Rebjock, Quentin; Stich, Sebastian Urban; Jaggi, Martin

2019

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Résumé

Sign-based algorithms (e.g. signSGD) have been proposed as a biased gradient compression technique to alleviate the communication bottleneck in training large neural networks across multiple workers. We show simple convex counter-examples where signSGD does not converge to the optimum. Further, even when it does converge, signSGD may generalize poorly when compared with SGD. These issues arise because of the biased nature of the sign compression operator. We then show that using error-feedback, i.e. incorporating the error made by the compression operator into the next step, overcomes these issues. We prove that our algorithm (EF-SGD) with arbitrary compression operator achieves the same rate of convergence as SGD without any additional assumptions. Thus EF-SGD achieves gradient compression for free. Our experiments thoroughly substantiate the theory.

Détails

Titre Error Feedback Fixes SignSGD and other Gradient Compression Schemes

Auteur(s) Karimireddy, Sai Praneeth Reddy ; Rebjock, Quentin ; Stich, Sebastian Urban ; Jaggi, Martin

Publié dans Proceedings of the International Conference on Machine Learning, 9-15 June 2019, Long Beach, California, USA

Série Proceedings of Machine Learning Research, 97

Volume 97

Pages 3252-3261

Présenté à 36th International Conference on Machine Learning (ICML) 2019, Long Beach, USA, June 9-15, 2019

Date 2019

Editeur PMLR

Mots-clés (libres)

ml-ai

Lien supplémentaire See paper at Publisher's site

Laboratoires MLO

Le document apparaît dans Production scientifique et compétences > I&C - Faculté Informatique & Communications > IINFCOM > MLO - Laboratoire d'apprentissage automatique et d'optimisation
Publications validées par des pairs
Papiers de conférence
Travail produit à l'EPFL

Date de création de la notice 2019-08-30

Files

Résumé

Détails

PDF