Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Comparing Dynamics: Deep Neural Networks versus Glassy Systems
 
conference paper

Comparing Dynamics: Deep Neural Networks versus Glassy Systems

Baity-Jesi, Marco
•
Sagun, Levent Dogus  
•
Geiger, Mario  
Show more
2018
Proceedings of the 35th International Conference on Machine Learning
35th International Conference on Machine Learning (ICML)

We analyze numerically the training dynamics of deep neural networks (DNN) by using methods developed in statistical physics of glassy systems. The two main issues we address are the complexity of the loss-landscape and of the dynamics within it, and to what extent DNNs share similarities with glassy systems. Our findings, obtained for different architectures and data-sets, suggest that during the training process the dynamics slows down because of an increasingly large number of flat directions. At large times, when the loss is approaching zero, the system diffuses at the bottom of the landscape. Despite some similarities with the dynamics of mean-field glassy systems, in particular, the absence of barrier crossing, we find distinctive dynamical behaviors in the two cases, thus showing that the statistical properties of the corresponding loss and energy landscapes are different. In contrast, when the network is under-parametrized we observe a typical glassy behavior, thus suggesting the existence of different phases depending on whether the network is under-parametrized or over-parametrized.

  • Details
  • Metrics
Type
conference paper
Author(s)
Baity-Jesi, Marco
Sagun, Levent Dogus  
Geiger, Mario  
Spigler, Stefano  
Ben Arous, Gérard  
Cammarota, Chiara
LeCun, Yann
Wesche, Rainer  
Biroli, Giulio
Date Issued

2018

Published in
Proceedings of the 35th International Conference on Machine Learning
Series title/Series vol.

Proceedings of Machine Learning Research; 80

Volume

PMLR 80

Start page

314

End page

323

URL

PDF

http://proceedings.mlr.press/v80/baity-jesi18a/baity-jesi18a.pdf
Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
PCSL  
Event nameEvent placeEvent date
35th International Conference on Machine Learning (ICML)

Stockholm, Sweden

July 10-15, 2018

Available on Infoscience
August 27, 2019
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/160662
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés