Audio-Visual Speech Recognition with a Hybrid SVM-HMM System

Gurban, M.; Thiran, J.

Gurban, M.; Thiran, J.

2005

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Traditional speech recognition systems use Gaussian mixture models to obtain the likelihoods of individual phonemes, which are then used as state emission probabilities in hidden Markov models representing the words. In hybrid systems, the Gaussian mixtures are replaced by more discriminant classifiers, leading to an improved performance. Most of the time the classifiers used in such systems are neural networks. Support vector machines have also been used in one-modality audio or visual speech recognition, but never in a multimodal audio-visual system. We propose such a hybrid SVM-HMM speech recognizer, and we show how the multimodal approach leads to better performance than that obtained with any of the two modalities individually.

Details

Title Audio-Visual Speech Recognition with a Hybrid SVM-HMM System

Author(s) Gurban, M. ; Thiran, J.

Published in 13th European Signal Processing Conference (EUSIPCO)

Date 2005

Publisher EUSIPCO

Keywords

LTS5

Laboratories LTS5

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LTS5 - Signal Processing Laboratory 5
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Record creation date 2006-06-14

Actions

Preview

Select file: