Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Speech Recognition Using Advanced HMM2 Features
 
Loading...
Thumbnail Image
conference paper

Speech Recognition Using Advanced HMM2 Features

Weber, Katrin
•
Bengio, Samy  
•
Bourlard, Hervé  
2001
Automatic Speech Recognition and Understanding Workshop
Automatic Speech Recognition and Understanding Workshop

HMM2 is a particular hidden Markov model where state emission probabilities of the temporal (primary) HMM are modeled through (secondary) state-dependent frequency-based HMMs [12]. As shown in [13], a secondary HMM can also be used to extract robust ASR features. Here, we further investigate this novel approach towards using a full HMM2 as feature extractor, working in the spectral domain, and extracting robust formant-like features for standard ASR system. HMM2 performs a nonlinear, state-dependent frequency warping, and it is shown that the resulting frequency segmentation actually contains particularly discriminant features. To further improve the HMM2 system, we complement the initial spectral energy vectors with frequency information. Finally, adding temporal information to the HMM2 feature vector yields further improvements. These conclusions are experimentally validated on the Numbers95 database, where word error rates of 15%, using only a 4-dimensional feature vector (3 formant-like parameters and one time index) were obtained.

  • Files
  • Details
  • Metrics
Type
conference paper
Author(s)
Weber, Katrin
•
Bengio, Samy  
•
Bourlard, Hervé  
Date Issued

2001

Publisher place

Madonna di Campiglio, Italy

Published in
Automatic Speech Recognition and Understanding Workshop
Subjects

speech

•

weber

•

bengio

•

bourlard

Note

IDIAP-rr 01-24

URL

URL

http://publications.idiap.ch/downloads/reports/2001/rr01-24.pdf

Related documents

http://publications.idiap.ch/index.php/publications/showcite/weber-rr-01-24
Written at

EPFL

EPFL units
LIDIAP  
Event name
Automatic Speech Recognition and Understanding Workshop
Available on Infoscience
March 10, 2006
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/228066
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés