How isotropic kernels perform on simple invariants

Paccolat, Jonas; Spigler, Stefano; Wyart, Matthieu

doi:10.1088/2632-2153/abd485

research article

How isotropic kernels perform on simple invariants

Paccolat, Jonas

•

Spigler, Stefano

•

Wyart, Matthieu

June 1, 2021

Machine Learning-Science And Technology

We investigate how the training curve of isotropic kernel methods depends on the symmetry of the task to be learned, in several settings. (i) We consider a regression task, where the target function is a Gaussian random field that depends only on d(parallel to) variables, fewer than the input dimension d. We compute the expected test error epsilon that follows epsilon similar to p(-beta) where p is the size of the training set. We find that beta similar to 1/d independently of d(parallel to), supporting previous findings that the presence of invariants does not resolve the curse of dimensionality for kernel regression. (ii) Next we consider support-vector binary classification and introduce the stripe model, where the data label depends on a single coordinate y((x) under bar) = y(x(1)), corresponding to parallel decision boundaries separating labels of different signs, and consider that there is no margin at these interfaces. We argue and confirm numerically that, for large bandwidth, beta = d-1+xi/3d-3+xi, where xi is an element of (0, 2) is the exponent characterizing the singularity of the kernel at the origin. This estimation improves classical bounds obtainable from Rademacher complexity. In this setting there is no curse of dimensionality since beta -> 1/3 as d -> infinity. (iii) We confirm these findings for the spherical model, for which y((x) under bar) = y(parallel to(x) under bar parallel to|). (iv) In the stripe model, we show that, if the data are compressed along their invariants by some factor lambda (an operation believed to take place in deep networks), the test error is reduced by a factor lambda(-2(d-1)/3d-3+xi).

Type

research article

DOI

10.1088/2632-2153/abd485

Web of Science ID

WOS:000660871600001

Authors

Paccolat, Jonas

•

Spigler, Stefano

•

Wyart, Matthieu

Publication date

2021-06-01

Publisher

IOP PUBLISHING LTD

Published in

Machine Learning-Science And Technology

Volume

2

Issue

2

Article Number

025020

Subjects

Computer Science, Art...

Computer Science, Int...

Multidisciplinary Sci...

Computer Science

Science & Technology ...

machine learning

kernel methods

statistical physics

support vector machin...

radial functions

representations

space

Peer reviewed

REVIEWED

EPFL units

PCSL

Available on Infoscience

July 3, 2021

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/179591