Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Asymptotics of feature learning in two-layer networks after one gradient-step
 
conference paper

Asymptotics of feature learning in two-layer networks after one gradient-step

Cui, Hugo Chao  
•
Pesce, Luca  
•
Dandi, Yatin  
Show more
2024
ICML'24: Proceedings of the 41st International Conference on Machine Learning
41st International Conference on Machine Learning (ICML) 2024

In this manuscript, we investigate the problem of how two-layer neural networks learn features from data, and improve over the kernel regime, after being trained with a single gradient descent step. Leveraging the insight from (Ba et al., 2022), we model the trained network by a spiked Random Features (sRF) model. Further building on recent progress on Gaussian universality (Dandi et al., 2023), we provide an exact asymptotic description of the generalization error of the sRF in the high-dimensional limit where the number of samples, the width, and the input dimension grow at a proportional rate. The resulting characterization for sRFs also captures closely the learning curves of the original network model. This enables us to understand how adapting to the data is crucial for the network to efficiently learn non-linear functions in the direction of the gradient - where at initialization it can only express linear functions in this regime.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

cui24d.pdf

Type

Main Document

Version

Published version

Access type

openaccess

License Condition

N/A

Size

673.92 KB

Format

Adobe PDF

Checksum (MD5)

dac80b8093ea21ad9e25da5b41004a1c

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés