A Bin Encoding Training Of A Spiking Neural Network Based Voice Activity Detection

Dellaferrera, Giorgia; Martinelli, Flavio; Cernak, Milos

doi:10.1109/ICASSP40776.2020.9054761

Dellaferrera, Giorgia; Martinelli, Flavio; Cernak, Milos

2020

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

Advances of deep learning for Artificial Neural Networks (ANNs) have led to significant improvements in the performance of digital signal processing systems implemented on digital chips. Although recent progress in low-power chips is remarkable, neuromorphic chips that run Spiking Neural Networks (SNNs) based applications offer an even lower power consumption, as a consequence of the ensuing sparse spike-based coding scheme. In this work, we develop a SNN-based Voice Activity Detection (VAD) system that belongs to the building blocks of any audio and speech processing system. We propose to use the bin encoding, a novel method to convert log mel filterbank bins of single-time frames into spike patterns. We integrate the proposed scheme in a bilayer spiking architecture which was evaluated on the QUT-NOISE-TIMIT corpus. Our approach shows that SNNs enable an ultra low-power implementation of a VAD classifier that consumes only 3.8 mu W, while achieving state-of-the-art performance. The code is freely available on Code Ocean [1].

Details

Title A Bin Encoding Training Of A Spiking Neural Network Based Voice Activity Detection

Author(s) Dellaferrera, Giorgia ; Martinelli, Flavio ; Cernak, Milos

Published in 2020 Ieee International Conference On Acoustics, Speech, And Signal Processing

Series International Conference on Acoustics Speech and Signal Processing ICASSP

Pages 3207-3211

Conference IEEE International Conference on Acoustics, Speech, and Signal Processing, Barcelona, SPAIN, May 04-08, 2020

Date 2020-01-01

Publisher New York, IEEE

ISSN 1520-6149

ISBN 978-1-5090-6631-5

Keywords

spiking neural networks; voice activity detection; bin encoding; supervised learning

DOI https://doi.org/10.1109/ICASSP40776.2020.9054761

Other identifier(s) View record in Web of Science

Laboratories LCN

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > LCN - Computational Neuroscience Laboratory
Scientific production and competences > SV - School of Life Sciences > BMI - Brain Mind Institute > LCN - Computational Neuroscience Laboratory
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Record creation date 2021-03-26