Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition

Stephenson, Todd Andrew; Magimai.-Doss, Mathew; Bourlard, Hervé

doi:10.1109/ICPR.2002.1047454

Stephenson, Todd Andrew; Magimai.-Doss, Mathew; Bourlard, Hervé

2002

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Standard hidden Markov models (HMMs), as used in automatic speech recognition (ASR), calculate their emission probabilities by an artificial neural network (ANN) or a Gaussian distribution conditioned on the hidden state variable, considering the emissions independent of any other variable in the model. Recent work showed the benefit of conditioning the emission distributions on a discrete auxiliary variable, which is observed in training and hidden in recognition. Related work has shown the utility of conditioning the emission distributions on a continuous auxiliary variable. We apply mixed Bayesian networks (BNs) to extend these works by introducing a continuous auxiliary variable that is observed in training but is hidden in recognition. We find that an auxiliary pitch variable conditioned itself upon the hidden state can degrade performance unless the auxiliary variable is also hidden. The performance, furthermore, can be improved by making the auxiliary pitch variable independent of the hidden state.

Details

Title Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition

Author(s) Stephenson, Todd Andrew ; Magimai.-Doss, Mathew ; Bourlard, Hervé

Published in International Conference on Pattern Recognition (ICPR 2002)

Volume 4

Pages 293-296

Conference International Conference on Pattern Recognition (ICPR~2002), Quebec City, PQ, Canada

Date 2002

Keywords

stephenson; speech

DOI https://doi.org/10.1109/ICPR.2002.1047454

Additional link URL; Related documents

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Conference Papers
Work produced at EPFL
Published

Record creation date 2006-03-10

Files

Abstract

Details

PDF