Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Solvable Model for Inheriting the Regularization through Knowledge Distillation
 
conference paper

Solvable Model for Inheriting the Regularization through Knowledge Distillation

Saglietti, Luca  
•
Zdeborová, Lenka  
December 16, 2021
Proceedings of the 2nd Mathematical and Scientific Machine Learning Conference
2nd Mathematical and Scientific Machine Learning Conference

In recent years the empirical success of transfer learning with neural networks has stimulated an increasing interest in obtaining a theoretical understanding of its core properties. Knowledge Dis- tillation where a smaller neural network is trained using the outputs of a larger neural network is a particularly interesting case of transfer learning. In the present work, we introduce a statistical physics framework that allows an analytic characterization of the properties of knowledge distil- lation (KD) in shallow neural networks. Focusing the analysis on a solvable model that exhibits a non-trivial generalization gap, we investigate the effectiveness of KD. We are able to show that, through KD, the regularization properties of the larger teacher model can be inherited by the smaller student and that the yielded generalization performance is closely linked to and limited by the op- timality of the teacher. Finally, we analyze the double descent phenomenology that can arise in the considered KD setting.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

saglietti22a_Solvable Model for Inheriting the Regularization through Knowledge Distillation.pdf

Type

N/a

Access type

openaccess

License Condition

n/a

Size

608.17 KB

Format

Adobe PDF

Checksum (MD5)

4f5ca34fb1c1153161b7f4f5bf79e7a7

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés