Design Patterns for Resource-Constrained Automated Deep-Learning Methods

Tuggener, Lukas; Amirian, Mohammadreza; Benites, Fernando; von Däniken, Pius; Gupta, Prakhar; Schilling, Frank-Peter; Stadelmann, Thilo

doi:10.3390/ai1040031

Tuggener, Lukas; Amirian, Mohammadreza; Benites, Fernando; von Däniken, Pius; Gupta, Prakhar; Schilling, Frank-Peter; Stadelmann, Thilo

2020

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Résumé

We present an extensive evaluation of a wide variety of promising design patterns for automated deep-learning (AutoDL) methods, organized according to the problem categories of the 2019 AutoDL challenges, which set the task of optimizing both model accuracy and search efficiency under tight time and computing constraints. We propose structured empirical evaluations as the most promising avenue to obtain design principles for deep-learning systems due to the absence of strong theoretical support. From these evaluations, we distill relevant patterns which give rise to neural network design recommendations. In particular, we establish (a) that very wide fully connected layers learn meaningful features faster; we illustrate (b) how the lack of pretraining in audio processing can be compensated by architecture search; we show (c) that in text processing deep-learning-based methods only pull ahead of traditional methods for short text lengths with less than a thousand characters under tight resource limitations; and lastly we present (d) evidence that in very data- and computing-constrained settings, hyperparameter tuning of more traditional machine-learning methods outperforms deep-learning systems.

Détails

Titre Design Patterns for Resource-Constrained Automated Deep-Learning Methods

Auteur(s) Tuggener, Lukas ; Amirian, Mohammadreza ; Benites, Fernando ; von Däniken, Pius ; Gupta, Prakhar ; Schilling, Frank-Peter ; Stadelmann, Thilo

Publié dans AI

Volume 1

Numéro 4

Pages 510-538

Date 2020-10-06

Mots-clés (libres)

automated machine learning; architecture design; computer vision; audio processing; natural language processing; weakly supervised learning

Note This is an Open Access article under the terms of the Creative Commons Attribution License

DOI https://doi.org/10.3390/ai1040031

Autres identifiant(s) DOI: https://doi.org/10.3390/ai1040031

Laboratoires MLO

Le document apparaît dans Production scientifique et compétences > I&C - Faculté Informatique & Communications > IINFCOM > MLO - Laboratoire d'apprentissage automatique et d'optimisation
Publications validées par des pairs
Travail produit à l'EPFL
Articles de journaux
Publié

Date de création de la notice 2021-01-07

Files

Résumé

Détails

PDF