The committee machine: computational to statistical gaps in learning a two-layers neural network

Aubin, Benjamin; Maillard, Antoine; Barbier, Jean; Krzakala, Florent; Macris, Nicolas; Zdeborova, Lenka

doi:10.1088/1742-5468/ab43d2

Aubin, Benjamin; Maillard, Antoine; Barbier, Jean; Krzakala, Florent; Macris, Nicolas; Zdeborova, Lenka

2019

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

Heuristic tools from statistical physics have been used in the past to locate the phase transitions and compute the optimal learning and generalization errors in the teacher-student scenario in multi-layer neural networks. In this paper, we provide a rigorous justification of these approaches for a two-layers neural network model called the committee machine, under a technical assumption. We also introduce a version of the approximate message passing (AMP) algorithm for the committee machine that allows optimal learning in polynomial time for a large set of parameters. We find that there are regimes in which a low generalization error is information-theoretically achievable while the AMP algorithm fails to deliver it; strongly suggesting that no efficient algorithm exists for those cases, unveiling a large computational gap.

Details

Title The committee machine: computational to statistical gaps in learning a two-layers neural network

Author(s) Aubin, Benjamin ; Maillard, Antoine ; Barbier, Jean ; Krzakala, Florent ; Macris, Nicolas ; Zdeborova, Lenka

Published in Journal Of Statistical Mechanics-Theory And Experiment

Volume 2019

Issue 12

Pages 124023

Date 2019-12-01

Publisher Bristol, IOP PUBLISHING LTD

ISSN 1742-5468

Keywords

machine learning; phase-transitions; space

DOI https://doi.org/10.1088/1742-5468/ab43d2

Other identifier(s) View record in Web of Science

Laboratories LTHC

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > LTHC - Communication Theories Laboratory
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2020-02-26

Abstract

Details

Actions