Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Hierarchical and Parallel Processing of Modulation Spectrum for ASR applications
 
conference paper

Hierarchical and Parallel Processing of Modulation Spectrum for ASR applications

Valente, Fabio
•
Hermansky, Hynek  
2008
2008 IEEE International Conference on Acoustics, Speech and Signal Processing
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

The modulation spectrum is an efficient representation for describing dynamic information in signals. In this work we investigate how to exploit different elements of the modulation spectrum for extraction of information in automatic recognition of speech (ASR). Parallel and hierarchical (sequential) approaches are investigated. Parallel processing combines outputs of independent classifiers applied to different modulation frequency channels. Hierarchical processing uses different modulation frequency channels sequentially. Experiments are run on a LVCSR task for meetings transcription and results are reported on the RT05 evaluation data. Processing modulation frequencies channels with different classifiers provides a consistent reduction in WER (2% absolute w.r.t. PLP baseline). Hierarchical processing outperforms parallel processing. The largest WER reduction is obtained trough sequential processing moving from high to low modulation frequencies. This model is consistent with several perceptual and physiological studies on auditory processing.

  • Files
  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ICASSP.2008.4518572
Author(s)
Valente, Fabio
Hermansky, Hynek  
Date Issued

2008

Published in
2008 IEEE International Conference on Acoustics, Speech and Signal Processing
Start page

4165

End page

4168

Note

IDIAP-RR 07-45

URL

URL

http://publications.idiap.ch/downloads/papers/2008/valente-ICASSP-2008.pdf

Related documents

http://publications.idiap.ch/index.php/publications/showcite/valente:rr07-45
Written at

EPFL

EPFL units
LIDIAP  
Event name
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
Available on Infoscience
February 11, 2010
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/47163
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés