Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Low-Level Physiological Implications of End-to-End Learning of Speech Recognition
 
conference paper

Low-Level Physiological Implications of End-to-End Learning of Speech Recognition

de Gibson, Louise Coppieters
•
Garner, Philip N.  
January 1, 2022
Interspeech 2022
Interspeech Conference

Current speech recognition architectures perform very well from the point of view of machine learning, hence user interaction. This suggests that they are emulating the human biological system well. We investigate whether the inference can be inverted to provide insights into that biological system; in particular the hearing mechanism. Using SincNet, we confirm that end-to-end systems do learn well known filterbank structures. However, we also show that wider band-width filters are important in the learned structure. Whilst some benefits can be gained by initialising both narrow and wide-band filters, physiological constraints suggest that such filters arise in mid-brain rather than the cochlea. We show that standard machine learning architectures must be modified to allow this process to be emulated neurally.

  • Details
  • Metrics
Type
conference paper
DOI
10.21437/Interspeech.2022-10093
Web of Science ID

WOS:000900724500091

Author(s)
de Gibson, Louise Coppieters
Garner, Philip N.  
Date Issued

2022-01-01

Publisher

ISCA-INT SPEECH COMMUNICATION ASSOC

Publisher place

Baixas

Published in
Interspeech 2022
Series title/Series vol.

Interspeech

Start page

749

End page

753

Subjects

Acoustics

•

Audiology & Speech-Language Pathology

•

Computer Science, Artificial Intelligence

•

Engineering, Electrical & Electronic

•

Acoustics

•

Audiology & Speech-Language Pathology

•

Computer Science

•

Engineering

•

speech recognition

•

cochlear models

•

end-to-end architectures

•

filterbanks

•

sincnet

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LIDIAP  
Event nameEvent placeEvent date
Interspeech Conference

Incheon, SOUTH KOREA

Sep 18-22, 2022

Available on Infoscience
March 27, 2023
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/196429
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés