Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Rigorous Dynamical Mean-Field Theory for Stochastic Gradient Descent Methods
 
research article

Rigorous Dynamical Mean-Field Theory for Stochastic Gradient Descent Methods

Gerbelot, Cédric
•
Troiani, Emanuele  
•
Mignacco, Francesca
Show more
May 6, 2024
SIAM Journal on Mathematics of Data Science

We prove closed-form equations for the exact high-dimensional asymptotics of a family of first-order gradient-based methods, learning an estimator (e.g., M-estimator, shallow neural network) from observations on Gaussian data with empirical risk minimization. This includes widely used algorithms such as stochastic gradient descent (SGD) or Nesterov acceleration. The obtained equations match those resulting from the discretization of dynamical mean-field theory equations from statistical physics when applied to the corresponding gradient flow. Our proof method allows us to give an explicit description of how memory kernels build up in the effective dynamics and to include nonseparable update functions, allowing datasets with nonidentity covariance matrices. Finally, we provide numerical implementations of the equations for SGD with generic extensive batch size and constant learning rates.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

23m1594388.pdf

Type

Main Document

Version

Published version

Access type

openaccess

License Condition

CC BY

Size

976.91 KB

Format

Adobe PDF

Checksum (MD5)

1767c0e9815b70588f08773a7b0df12a

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés