Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Exploiting Low-dimensional Structures to Enhance DNN based Acoustic Modeling in Speech Recognition
 
conference paper

Exploiting Low-dimensional Structures to Enhance DNN based Acoustic Modeling in Speech Recognition

Dighe, Pranay
•
Luyet, Gil
•
Asaei, Afsaneh  
Show more
2016
2016 Ieee International Conference On Acoustics, Speech And Signal Processing Proceedings
Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016)

We propose to model the acoustic space of deep neural network (DNN) class-conditional posterior probabilities as a union of low- dimensional subspaces. To that end, the training posteriors are used for dictionary learning and sparse coding. Sparse representation of the test posteriors using this dictionary enables projection to the space of training data. Relying on the fact that the intrinsic di- mensions of the posterior subspaces are indeed very small and the matrix of all posteriors belonging to a class has a very low rank, we demonstrate how low-dimensional structures enable further en- hancement of the posteriors and rectify the spurious errors due to mismatch conditions. The enhanced acoustic modeling method leads to improvements in continuous speech recognition task using hybrid DNN-HMM (hidden Markov model) framework in both clean and noisy conditions, where upto 15.4% relative reduction in word error rate (WER) is achieved.

  • Files
  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ICASSP.2016.7472767
Web of Science ID

WOS:000388373405168

Author(s)
Dighe, Pranay
Luyet, Gil
Asaei, Afsaneh  
Bourlard, Hervé  
Date Issued

2016

Publisher

IEEE

Publisher place

New York

Published in
2016 Ieee International Conference On Acoustics, Speech And Signal Processing Proceedings
ISBN of the book

978-1-4799-9988-0

Total of pages

5

Start page

5690

End page

5694

Subjects

Sparse coding

•

Dictionary learning

•

Deep neural network

•

Union of Low Dimensional Subspaces

•

Acoustic modeling

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LIDIAP  
Event nameEvent place
Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016)

Shanghai

Available on Infoscience
April 19, 2016
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/125782
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés