Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. A Bin Encoding Training Of A Spiking Neural Network Based Voice Activity Detection
 
conference paper

A Bin Encoding Training Of A Spiking Neural Network Based Voice Activity Detection

Dellaferrera, Giorgia
•
Martinelli, Flavio  
•
Cernak, Milos
January 1, 2020
2020 Ieee International Conference On Acoustics, Speech, And Signal Processing
IEEE International Conference on Acoustics, Speech, and Signal Processing

Advances of deep learning for Artificial Neural Networks (ANNs) have led to significant improvements in the performance of digital signal processing systems implemented on digital chips. Although recent progress in low-power chips is remarkable, neuromorphic chips that run Spiking Neural Networks (SNNs) based applications offer an even lower power consumption, as a consequence of the ensuing sparse spike-based coding scheme. In this work, we develop a SNN-based Voice Activity Detection (VAD) system that belongs to the building blocks of any audio and speech processing system. We propose to use the bin encoding, a novel method to convert log mel filterbank bins of single-time frames into spike patterns. We integrate the proposed scheme in a bilayer spiking architecture which was evaluated on the QUT-NOISE-TIMIT corpus. Our approach shows that SNNs enable an ultra low-power implementation of a VAD classifier that consumes only 3.8 mu W, while achieving state-of-the-art performance. The code is freely available on Code Ocean [1].

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ICASSP40776.2020.9054761
Web of Science ID

WOS:000615970403091

Author(s)
Dellaferrera, Giorgia
Martinelli, Flavio  
Cernak, Milos
Date Issued

2020-01-01

Publisher

IEEE

Publisher place

New York

Published in
2020 Ieee International Conference On Acoustics, Speech, And Signal Processing
ISBN of the book

978-1-5090-6631-5

Series title/Series vol.

International Conference on Acoustics Speech and Signal Processing ICASSP

Start page

3207

End page

3211

Subjects

Acoustics

•

Engineering, Electrical & Electronic

•

Engineering

•

spiking neural networks

•

voice activity detection

•

bin encoding

•

supervised learning

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LCN  
Event nameEvent placeEvent date
IEEE International Conference on Acoustics, Speech, and Signal Processing

Barcelona, SPAIN

May 04-08, 2020

Available on Infoscience
March 26, 2021
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/176280
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés