Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition
 
conference paper

PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition

Bergsma, Boris  
•
Yang, Minhao
•
Cernak, Milos
January 1, 2022
Interspeech 2022
Interspeech Conference

At the end of Moore's law, new computing paradigms are required to prolong the battery life of wearable and IoT smart audio devices. Theoretical analysis and physical validation have shown that analog signal processing (ASP) can be more power-efficient than its digital counterpart in the realm of lowto-medium signal-to-noise ratio applications. In addition, ASP allows a direct interface with an analog microphone without a power-hungry analog-to-digital converter. Here, we present power-efficient analog acoustic features (PEAF) that are validated by fabricated CMOS chips for running audio recognition. Linear, non-linear, and learnable PEAF variants are evaluated on two speech processing tasks that are demanded in many battery-operated devices: wake word detection (WWD) and keyword spotting (KWS). Compared to digital acoustic features, higher power efficiency with competitive classification accuracy can be obtained. A novel theoretical framework based on information theory is established to analyze the information flow in each individual stage of the feature extraction pipeline. The analysis identifies the information bottleneck and helps improve the KWS accuracy by up to 7%. This work may pave the way to building more power-efficient smart audio devices with best-in-class inference performance.

  • Details
  • Metrics
Type
conference paper
DOI
10.21437/Interspeech.2022-10412
Web of Science ID

WOS:000900724500077

Author(s)
Bergsma, Boris  
Yang, Minhao
Cernak, Milos
Date Issued

2022-01-01

Publisher

ISCA-INT SPEECH COMMUNICATION ASSOC

Publisher place

Baixas

Published in
Interspeech 2022
Series title/Series vol.

Interspeech

Start page

381

End page

385

Subjects

Acoustics

•

Audiology & Speech-Language Pathology

•

Computer Science, Artificial Intelligence

•

Engineering, Electrical & Electronic

•

Acoustics

•

Audiology & Speech-Language Pathology

•

Computer Science

•

Engineering

•

analog systems

•

audio classification

•

power efficiency

•

information theory

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
C3MP  
Event nameEvent placeEvent date
Interspeech Conference

Incheon, SOUTH KOREA

Sep 18-22, 2022

Available on Infoscience
March 27, 2023
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/196465
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés