Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. An Accuracy-Driven Compression Methodology to Derive Efficient Codebook-Based CNNs
 
conference paper not in proceedings

An Accuracy-Driven Compression Methodology to Derive Efficient Codebook-Based CNNs

Ponzina, Flavio  
•
Peon Quiros, Miguel  
•
Ansaloni, Giovanni  
Show more
2022
IEEE International Conference on Omni-Layer Intelligent Systems (COINS)

Codebook-based optimizations are a class of algorithmic-level transformations able to effectively reduce the computing and memory requirements of Convolutional Neural Networks (CNNs). This approach tightly limits the number of unique weights in each layer, allowing the storage of employed values in codebooks containing a small number of floating-point entries. Then, CNN models are represented as low-bitwidth indexes of such codebooks. This work introduces a novel iterative methodology to find highly beneficial schemes trading off accuracy and model compression in codebook-based CNNs. Our strategy can retrieve non-uniform solutions driven by an accuracy constraint embedded in the optimization loop. Our results indicate that, for a 1% accuracy degradation, our methodology can compress baseline floating-point CNN models up to 19x. Moreover, by reducing the number of memory accesses, our strategy increases energy efficiency and improves inference performance by up to 91%.

  • Files
  • Details
  • Metrics
Type
conference paper not in proceedings
DOI
10.1109/COINS54846.2022.9854986
Author(s)
Ponzina, Flavio  
Peon Quiros, Miguel  
Ansaloni, Giovanni  
Atienza Alonso, David  
Date Issued

2022

Total of pages

6

Subjects

CNN compression

•

Clustering

•

Ensembling

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
ESL  
Event nameEvent placeEvent date
IEEE International Conference on Omni-Layer Intelligent Systems (COINS)

Barcelona, Spain

August 1-3, 2022

Available on Infoscience
June 8, 2022
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/188435
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés