Using entropy as a stream reliability estimate for audio-visual speech recognition

Gurban, Mihai; Thiran, Jean-Philippe

2008

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

We present a method for dynamically integrating audio-visual information for speech recognition, based on the estimated reliability of the audio and visual streams. Our method uses an information theoretic measure, the entropy derived from the state probability distribution for each stream, as an estimate of reliability. The two modalities, audio and video, are weighted at each time instant according to their reliability. In this way, the weights vary dynamically and are able to adapt to any type of noise in each modality, and more importantly, to unexpected variations in the level of noise.

Details

Title Using entropy as a stream reliability estimate for audio-visual speech recognition

Author(s) Gurban, Mihai ; Thiran, Jean-Philippe

Published in 16th European Signal Processing Conference

Conference 16th European Signal Processing Conference, Lausanne, Switzerland, August 25-29, 2008

Date 2008

Publisher Lausanne, Switzerland

Keywords

LTS5

Additional link URL

Laboratories LTS5

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LTS5 - Signal Processing Laboratory 5
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Record creation date 2008-06-09

Actions

Preview

Select file: