Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Robust Speech Recognition and Feature Extraction Using HMM2
 
research article

Robust Speech Recognition and Feature Extraction Using HMM2

Weber, Katrin
•
Ikbal, Shajith
•
Bengio, Samy  
Show more
2003
Computer Speech & Language

This paper presents the theoretical basis and preliminary experimental results of a new HMM model, referred to as HMM2, which can be considered as a mixture of HMMs. In this new model, the emission probabilities of the temporal (primary) HMM are estimated through secondary, state specific, HMMs working in the acoustic feature space. Thus, while the primary HMM is performing the usual time warping and integration, the secondary HMMs are responsible for extracting/modeling the possible feature dependencies, while performing frequency warping and integration. Such a model has several potential advantages, such as a more flexible modeling of the time/frequency structure of the speech signal. When working with spectral features, such a system can also perform nonlinear spectral warping, effectively implementing a form of nonlinear vocal tract normalization. Furthermore, it will be shown that HMM2 can be used to extract noise robust features, supposed to correspond to formant regions, which can be used as extra features for traditional HMM recognizers to improve their performance. These issues are evaluated in the present paper, and different experimental results are reported on the Numbers95 database.

  • Details
  • Metrics
Type
research article
DOI
10.1016/S0885-2308(03)00012-3
Web of Science ID

WOS:000183091500006

Author(s)
Weber, Katrin
Ikbal, Shajith
Bengio, Samy  
Bourlard, Hervé  
Date Issued

2003

Published in
Computer Speech & Language
Volume

17

Issue

2-3

Start page

195

End page

211

Subjects

speech

URL

Related documents

http://publications.idiap.ch/index.php/publications/showcite/weber-rr-01-42
Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LIDIAP  
Available on Infoscience
March 10, 2006
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/228276
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés