Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Dynamical mean-field theory for stochastic gradient descent in Gaussian mixture classification*
 
research article

Dynamical mean-field theory for stochastic gradient descent in Gaussian mixture classification*

Mignacco, Francesca
•
Krzakala, Florent  
•
Urbani, Pierfrancesco
Show more
December 1, 2021
Journal Of Statistical Mechanics-Theory And Experiment

We analyze in a closed form the learning dynamics of the stochastic gradient descent (SGD) for a single-layer neural network classifying a high-dimensional Gaussian mixture where each cluster is assigned one of two labels. This problem provides a prototype of a non-convex loss landscape with interpolating regimes and a large generalization gap. We define a particular stochastic process for which SGD can be extended to a continuous-time limit that we call stochastic gradient flow. In the full-batch limit, we recover the standard gradient flow. We apply dynamical mean-field theory from statistical physics to track the dynamics of the algorithm in the high-dimensional limit via a self-consistent stochastic process. We explore the performance of the algorithm as a function of the control parameters shedding light on how it navigates the loss landscape.

  • Details
  • Metrics
Type
research article
DOI
10.1088/1742-5468/ac3a80
Web of Science ID

WOS:000735638900001

Author(s)
Mignacco, Francesca
Krzakala, Florent  
Urbani, Pierfrancesco
Zdeborova, And Lenka  
Date Issued

2021-12-01

Published in
Journal Of Statistical Mechanics-Theory And Experiment
Volume

2021

Issue

12

Article Number

124008

Subjects

Mechanics

•

Physics, Mathematical

•

Physics

•

learning theory

•

machine learning

•

systems

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
IDEPHICS1  
IDEPHICS2  
SPOC2  
Available on Infoscience
January 15, 2022
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/184556
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés