Different Weighting Schemes in the Full Combination Subbands Approach for Noise Robust ASR

Hagen, Astrid; Morris, Andrew; Bourlard, Hervé

Hagen, Astrid; Morris, Andrew; Bourlard, Hervé

1999

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Résumé

In this paper, we present and investigate a new method for subband-based Automatic Speech Recognition (ASR) which approximates the ideal `full combination' approach which is itself often not practical to realize. The `full combination' approach consists of explicitly considering all possible combinations of subbands (\cite{Hermansky96:TAO}) avoiding the usually necessary independence assumption, which would limit the potential of subband-based ASR. We show how this ideal approach can be effectuated by a nonlinear combination function which constitutes the fullband posterior probabilities decomposed into a weighted sum of posterior probabilities from Artificial Neural Network (ANN) experts. This involves training of one expert for each possible subband combination. To limit such extensive training, we have found that it is possible to achieve comparable results by estimating the subband posterios for each combinationas a function of the posteriors from the individual subbands alone (\cite{Hagen98:SBS,Morris99:TFC}). The theoretical foundation of our solution to the ideal `full combination' approach with the nonlinear combination function and its approximation are presented. The weights,which represent the relative utility for recognition of each subband combination, are very important for this technique and possible schemes for their estimation will be proposed. They have been tested and compared in the framework of HMM/ANN-Hybrid systems on clean and noise-added data.

Détails

Titre Different Weighting Schemes in the Full Combination Subbands Approach for Noise Robust ASR

Auteur(s) Hagen, Astrid ; Morris, Andrew ; Bourlard, Hervé

Publié dans Robust Methods for Speech Recognition in Adverse Conditions

Présenté à Robust Methods for Speech Recognition in Adverse Conditions

Date 1999

Editeur Tampere, Finland

Mots-clés (libres)

subbands; noise; hagen; multiband; weighting; morris; bourlard; speech; Noise; HMM/ANN-Hybrid

Note IDIAP-RR 99-11

Lien supplémentaire URL

Laboratoires LIDIAP

Le document apparaît dans Production scientifique et compétences > STI - Faculté des sciences et techniques de l'ingénieur > IEM - Institute of Electrical and Micro Engineering > LIDIAP - Laboratoire de l'IDIAP
Production scientifique et compétences > Euler Center for Signal Processing
Papiers de conférence
Travail produit à l'EPFL
Publié

Date de création de la notice 2006-03-10

Files

Résumé

Détails

PDF