Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Dynamical mean-field theory for stochastic gradient descent in Gaussian mixture classification
 
conference paper

Dynamical mean-field theory for stochastic gradient descent in Gaussian mixture classification

Mignacco, Francesca
•
Krzakala, Florent  
•
Urbani, Pierfrancesco
Show more
2020
Proceeding of the 2020 Advances in Neural Information Processing Systems
Advances in Neural Information Processing Systems

We analyze in a closed form the learning dynamics of stochastic gradient descent (SGD) for a single layer neural network classifying a high-dimensional Gaussian mixture where each cluster is assigned one of two labels. This problem provides a prototype of a non-convex loss landscape with interpolating regimes and a large generalization gap. We define a particular stochastic process for which SGD can be extended to a continuous-time limit that we call stochastic gradient flow. In the full-batch limit we recover the standard gradient flow. We apply dynamical mean-field theory from statistical physics to track the dynamics of the algorithm in the high-dimensional limit via a self-consistent stochastic process. We explore the performance of the algorithm as a function of control parameters shedding light on how it navigates the loss landscape

  • Details
  • Metrics
Type
conference paper
Author(s)
Mignacco, Francesca
Krzakala, Florent  
Urbani, Pierfrancesco
Zdeborová, Lenka  
Date Issued

2020

Publisher

Curran Associates, Inc.

Published in
Proceeding of the 2020 Advances in Neural Information Processing Systems
Series title/Series vol.

Advances in Neural Information Processing Systems; 33

Volume

33

Start page

9540

URL

paper

https://papers.nips.cc/paper/2020/file/6c81c83c4bd0b58850495f603ab45a93-Paper.pdf
Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
IDEPHICS1  
IDEPHICS2  
SPOC1  
Show more
Event nameEvent date
Advances in Neural Information Processing Systems

Dec 6, 2020 – Dec 12, 2020

Available on Infoscience
March 5, 2021
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/175774
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés