Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. An Initial Alignment between Neural Network and Target is Needed for Gradient Descent to Learn
 
conference paper

An Initial Alignment between Neural Network and Target is Needed for Gradient Descent to Learn

Abbe, Emmanuel  
•
Cornacchia, Elisabetta  
•
Hazla, Jan  
Show more
January 1, 2022
International Conference On Machine Learning, Vol 162
38th International Conference on Machine Learning (ICML)

This paper introduces the notion of "Initial Alignment" (INAL) between a neural network at initialization and a target function. It is proved that if a network and a Boolean target function do not have a noticeable INAL, then noisy gradient descent on a fully connected network with normalized i.i.d. initialization will not learn in polynomial time. Thus a certain amount of knowledge about the target (measured by the INAL) is needed in the architecture design. This also provides an answer to an open problem posed in (Abbe & Sandon, 2020a). The results are based on deriving lower bounds for descent algorithms on symmetric neural networks without explicit knowledge of the target function beyond its INAL.

  • Details
  • Metrics
Type
conference paper
Web of Science ID

WOS:000899944900003

Author(s)
Abbe, Emmanuel  
Cornacchia, Elisabetta  
Hazla, Jan  
Marquis, Christopher
Date Issued

2022-01-01

Publisher

JMLR-JOURNAL MACHINE LEARNING RESEARCH

Publisher place

San Diego

Published in
International Conference On Machine Learning, Vol 162
Series title/Series vol.

Proceedings of Machine Learning Research

Start page

33

End page

52

Subjects

Computer Science, Artificial Intelligence

•

Computer Science

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

Event nameEvent placeEvent date
38th International Conference on Machine Learning (ICML)

Baltimore, MD

Jul 17-23, 2022

Available on Infoscience
March 13, 2023
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/195715
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés