Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Polynomial-time universality and limitations of deep learning
 
research article

Polynomial-time universality and limitations of deep learning

Abbe, Emmanuel  
•
Sandon, Colin
June 30, 2023
Communications On Pure And Applied Mathematics

The goal of this paper is to characterize function distributions that general neural networks trained by descent algorithms (GD/SGD), can or cannot learn in polytime. The results are: (1) The paradigm of general neural networks trained by SGD is poly-time universal: any function distribution that can be learned from samples in polytime can also be learned by a poly-size neural net trained by SGD with polynomial parameters. In particular, this can be achieved despite polynomial noise on the gradients, implying a separation result between SGD-based deep learning and statistical query algorithms, as the latter are not comparably universal due to cases like parities. This also shows that deep learning does not suffer from the limitations of shallow networks. (2) The paper further gives a lower-bound on the generalization error of descent algorithms, which relies on two quantities: the cross-predictability, an average-case quantity related to the statistical dimension, and the null-flow, a quantity specific to descent algorithms. The lower-bound implies in particular that for functions of low enough cross-predictability, the above robust universality breaks down once the gradients are averaged over too many samples (as in perfect GD) rather than fewer (as in SGD). (3) Finally, it is shown that if larger amounts of noise are added on the initialization and on the gradients, then SGD is no longer comparably universal due again to distributions having low enough cross-predictability.

  • Details
  • Metrics
Type
research article
DOI
10.1002/cpa.22121
Web of Science ID

WOS:001017636800001

Author(s)
Abbe, Emmanuel  
Sandon, Colin
Date Issued

2023-06-30

Publisher

WILEY

Published in
Communications On Pure And Applied Mathematics
Subjects

Mathematics, Applied

•

Mathematics

•

Mathematics

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

Available on Infoscience
July 17, 2023
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/199128
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés