Low cost duration modelling for noise robust speech recognition

Morris, Andrew; Payne, Simon; Bourlard, Hervé

Morris, Andrew; Payne, Simon; Bourlard, Hervé

2002

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Fichiers

Résumé

State transition matrices as used in standard HMM decoders have two widely perceived limitations. One is that the implicit Geometric state duration distributions which they model do not accurately reflect true duration distributions. The other is that they impose no hard limit on maximum duration with the result that state transition probabilities often have little influence when combined with acoustic probabilities, which are of a different order of magnitude. Explicit duration models were developed in the past to address the first problem. These were not widely taken up because their performance advantage in clean speech recognition was often not sufficiently great to offset the extra complexity which they introduced. However, duration models have much greater potential when applied to noisy speech recognition. In this paper we present a simple and generic form of explicit duration model and show that this leads to strong performance improvements when applied to connected digit recognition in noise.

Détails

Titre Low cost duration modelling for noise robust speech recognition

Auteur(s) Morris, Andrew ; Payne, Simon ; Bourlard, Hervé

Date 2002

Editeur IDIAP

Mots-clés (libres)

duration models; speech; HMMs; noise robust ASR

Lien supplémentaire URL

Laboratoires LIDIAP

Le document apparaît dans Production scientifique et compétences > STI - Faculté des sciences et techniques de l'ingénieur > IEM - Institute of Electrical and Micro Engineering > LIDIAP - Laboratoire de l'IDIAP
Production scientifique et compétences > Euler Center for Signal Processing
Travail produit à l'EPFL
Rapports techniques
Publié

Date de création de la notice 2006-03-10

Actions

Aperçu

Sélectionner le fichier :