An Analysis of Super-Net Heuristics in Weight-Sharing NAS

Yu, Kaicheng; Ranftl, Rene; Salzmann, Mathieu

doi:10.1109/TPAMI.2021.3108480

Yu, Kaicheng; Ranftl, Rene; Salzmann, Mathieu

2022

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

Weight sharing promises to make neural architecture search (NAS) tractable even on commodity hardware. Existing methods in this space rely on a diverse set of heuristics to design and train the shared-weight backbone network, a.k.a. the super-net. Since heuristics substantially vary across different methods and have not been carefully studied, it is unclear to which extent they impact super-net training and hence the weight-sharing NAS algorithms. In this paper, we disentangle super-net training from the search algorithm, isolate 14 frequently-used training heuristics, and evaluate them over three benchmark search spaces. Our analysis uncovers that several commonly-used heuristics negatively impact the correlation between super-net and stand-alone performance, whereas simple, but often overlooked factors, such as proper hyper-parameter settings, are key to achieve strong performance. Equipped with this knowledge, we show that simple random search achieves competitive performance to complex state-of-the-art NAS algorithms when the super-net is properly trained.

Details

Title An Analysis of Super-Net Heuristics in Weight-Sharing NAS

Author(s) Yu, Kaicheng ; Ranftl, Rene ; Salzmann, Mathieu

Published in Ieee Transactions On Pattern Analysis And Machine Intelligence

Volume 44

Issue 11

Pages 8110-8124

Date 2022-11-01

Publisher Los Alamitos, IEEE COMPUTER SOC

ISSN 0162-8828
1939-3539

Keywords

training; protocols; computer architecture; task analysis; measurement; benchmark testing; encoding; automl; neural architecture search; weight-sharing; super-net

DOI https://doi.org/10.1109/TPAMI.2021.3108480

Other identifier(s) View record in Web of Science

Laboratories CVLAB

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > CVLAB - Computer Vision Laboratory
Scientific production and competences > Euler Center for Signal Processing
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2022-11-07

Abstract

Details

Actions