Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Phase diagram of Stochastic Gradient Descent in high-dimensional two-layer neural networks
 
conference paper not in proceedings

Phase diagram of Stochastic Gradient Descent in high-dimensional two-layer neural networks

Veiga, Rodrigo  
•
Stephan, Ludovic  
•
Loureiro, Bruno  
Show more
2022
36th Conference on Neural Information Processing Systems (NeurIPS 2022)

Despite the non-convex optimization landscape, over-parametrized shallow networks are able to achieve global convergence under gradient descent. The picture can be radically different for narrow net-works, which tend to get stuck in badly-generalizing local minima. Here we investigate the cross-over between these two regimes in the high-dimensional setting, and in particular investigate the connection between the so-called mean-field/hydrodynamic regime and the seminal approach of Saad & Solla. Focusing on the case of Gaussian data, we study the interplay between the learning rate, the time scale, and the number of hidden units in the high-dimensional dynamics of stochastic gradient descent (SGD). Our work builds on a deterministic description of SGD in high-dimensions from statistical physics, which we extend and for which we provide rigorous convergence rates.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

NeurIPS-2022-phase-diagram-of-stochastic-gradient-descent-in-high-dimensional-two-layer-neural-networks-Paper-Conference.pdf

Type

N/a

Access type

openaccess

License Condition

n/a

Size

1.74 MB

Format

Adobe PDF

Checksum (MD5)

bd014c2285c517479b09906ee5449a52

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés