Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. On the Almost Sure Convergence of Stochastic Gradient Descent in Non-Convex Problems
 
conference paper

On the Almost Sure Convergence of Stochastic Gradient Descent in Non-Convex Problems

Mertikopoulos, Panayotis
•
Hallak, Nadav
•
Kavis, Ali  
Show more
2020
NeurIPS Proceedings
34th Conference on Neural Information Processing Systems (NeurIPS 2020)

This paper analyzes the trajectories of stochastic gradient descent (SGD) to help understand the algorithm’s convergence properties in non-convex problems. We first show that the sequence of iterates generated by SGD remains bounded and converges with probability 1 under a very broad range of step-size schedules. Subsequently, going beyond existing positive probability guarantees, we show that SGD avoids strict saddle points/manifolds with probability 1 for the entire spectrum of step-size policies considered. Finally, we prove that the algorithm’s rate of convergence to local minimizers with a positive-definite Hessian is $O(1/n^p)$ if the method is employed with a $\Theta(1/n^p)$ step-size. This provides an important guideline for tuning the algorithm’s step-size as it suggests that a cool-down phase with a vanishing step-size could lead to faster convergence; we demonstrate this heuristic using ResNet architectures on CIFAR.

  • Files
  • Details
  • Metrics
Type
conference paper
Author(s)
Mertikopoulos, Panayotis
Hallak, Nadav
Kavis, Ali  
Cevher, Volkan
Date Issued

2020

Published in
NeurIPS Proceedings
Total of pages

30

Series title/Series vol.

Advances in Neural Information Processing Systems; 33

Subjects

ml-ai

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LIONS  
Event nameEvent placeEvent date
34th Conference on Neural Information Processing Systems (NeurIPS 2020)

Virtual

December 6-12, 2020

Available on Infoscience
November 18, 2021
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/183045
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés