U-Boost NAS: Utilization-Boosted Differentiable Neural Architecture Search

Yuzuguler, Ahmet Caner; Dimitriadis, Nikolaos; Frossard, Pascal

doi:10.1007/978-3-031-19775-8_11

conference paper

U-Boost NAS: Utilization-Boosted Differentiable Neural Architecture Search

Yuzuguler, Ahmet Caner

•

Dimitriadis, Nikolaos

•

Frossard, Pascal

January 1, 2022

Computer Vision, Eccv 2022, Pt Xii

17th European Conference on Computer Vision (ECCV)

Optimizing resource utilization in target platforms is key to achieving high performance during DNN inference. While optimizations have been proposed for inference latency, memory footprint, and energy consumption, prior hardware-aware neural architecture search (NAS) methods have omitted resource utilization, preventing DNNs to take full advantage of the target inference platforms. Modeling resource utilization efficiently and accurately is challenging, especially for widely-used array-based inference accelerators such as Google TPU. In this work, we propose a novel hardware-aware NAS framework that does not only optimize for task accuracy and inference latency, but also for resource utilization. We also propose and validate a new computational model for resource utilization in inference accelerators. By using the proposed NAS framework and the proposed resource utilization model, we achieve 2.8 - 4x speedup for DNN inference compared to prior hardware-aware NAS methods while attaining similar or improved accuracy in image classification on CIFAR-10 and Imagenet-100 datasets. (Source code is available at https://github.com/yuezuegu/LBoostNAS).

Type

conference paper

DOI

10.1007/978-3-031-19775-8_11

Web of Science ID

WOS:000897093900011

Authors

Yuzuguler, Ahmet Caner

•

Dimitriadis, Nikolaos

•

Frossard, Pascal

Publication date

2022-01-01

Publisher

SPRINGER INTERNATIONAL PUBLISHING AG

Published in

Computer Vision, Eccv 2022, Pt Xii

ISBN of the book

978-3-031-19774-1

978-3-031-19775-8

Publisher place

Cham

Series title/Series vol.

Lecture Notes in Computer Science

Volume

13672

Start page

173

End page

190

Subjects

Computer Science, Art...

Imaging Science & Pho...

Computer Science

hardware-aware neural...

dnn inference

hardware accelerator

resource utilization

algorithms

Peer reviewed

REVIEWED

EPFL units

LTS4

Event name	Event place	Event date
17th European Conference on Computer Vision (ECCV)	Tel Aviv, ISRAEL	Oct 23-27, 2022

Available on Infoscience

January 30, 2023

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/194363