Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Comparing dynamics: deep neural networks versus glassy systems
 
research article

Comparing dynamics: deep neural networks versus glassy systems

Baity-Jesi, Marco
•
Sagun, Levent  
•
Geiger, Mario  
Show more
December 1, 2019
Journal Of Statistical Mechanics-Theory And Experiment

We analyze numerically the training dynamics of deep neural networks (DNN) by using methods developed in statistical physics of glassy systems. The two main issues we address are (1) the complexity of the loss landscape and of the dynamics within it, and (2) to what extent DNNs share similarities with glassy systems. Our findings, obtained for different architectures and datasets, suggest that during the training process the dynamics slows down because of an increasingly large number of flat directions. At large times, when the loss is approaching zero, the system diffuses at the bottom of the landscape. Despite some similarities with the dynamics of mean-field glassy systems, in particular, the absence of barrier crossing, we find distinctive dynamical behaviors in the two cases, showing that the statistical properties of the corresponding loss and energy landscapes are different. In contrast, when the network is under-parametrized we observe a typical glassy behavior, thus suggesting the existence of different phases depending on whether the network is under-parametrized or over-parametrized.

  • Details
  • Metrics
Type
research article
DOI
10.1088/1742-5468/ab3281
Web of Science ID

WOS:000510503800012

Author(s)
Baity-Jesi, Marco
Sagun, Levent  
Geiger, Mario  
Spigler, Stefano  
Ben Arpus, Gerard
Cammarpta, Chiara
LeCun, Yann
Wyart, Matthieu  
Biroli, Giulio
Date Issued

2019-12-01

Publisher

IOP PUBLISHING LTD

Published in
Journal Of Statistical Mechanics-Theory And Experiment
Volume

2019

Issue

12

Article Number

124013

Subjects

Mechanics

•

Physics, Mathematical

•

Physics

•

machine learning

•

states

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
PCSL  
Available on Infoscience
February 26, 2020
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/166501
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés