Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Are Gaussian data all you need? the extents and limits of universality in high-dimensional generalized linear estimation
 
conference paper

Are Gaussian data all you need? the extents and limits of universality in high-dimensional generalized linear estimation

Pesce, Luca  
•
Krzakala, Florent  
•
Loureiro, Bruno
Show more
2023
International Conference on Machine Learning, 23-29 July 2023, Honolulu, Hawaii, USA
International Conference on Machine Learning 2023

In this manuscript we consider the problem of generalized linear estimation on Gaussian mixture data with labels given by a single-index model. Our first result is a sharp asymptotic expression for the test and training errors in the high-dimensional regime. Motivated by the recent stream of results on the Gaussian universality of the test and training errors in generalized linear estimation, we ask ourselves the question: "when is a single Gaussian enough to characterize the error?". Our formulas allow us to give sharp answers to this question, both in the positive and negative directions. More precisely, we show that the sufficient conditions for Gaussian universality (or lack thereof) crucially depend on the alignment between the target weights and the means and covariances of the mixture clusters, which we precisely quantify. In the particular case of least-squares interpolation, we prove a strong universality property of the training error and show it follows a simple, closed-form expression. Finally, we apply our results to real datasets, clarifying some recent discussions in the literature about Gaussian universality of the errors in this context.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

gaussian_icml.pdf

Type

Main Document

Version

Published version

Access type

openaccess

License Condition

Creative Commons Attribution 4.0 International

Size

3.15 MB

Format

Adobe PDF

Checksum (MD5)

1a0d160f219acc949f2572d436cd3283

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés