Dynamic modality weighting for multi-stream HMMs in Audio- Visual Speech Recognition

Gurban, Mihai; Thiran, Jean-Philippe; Drugman, Thomas; Dutoit, Thierry

doi:10.1145/1452392.1452442

conference paper

Dynamic modality weighting for multi-stream HMMs in Audio- Visual Speech Recognition

Gurban, Mihai

•

Thiran, Jean-Philippe

•

Drugman, Thomas

more

2008

Proceedings of the 10th International Conference on Multimodal Interfaces

10th International Conference on Multimodal Interfaces

Merging decisions from different modalities is a crucial problem in Audio-Visual Speech Recognition. To solve this, state synchronous multi-stream HMMs have been proposed for their important advantage of incorporating stream reliability in their fusion scheme. This paper focuses on stream weight adaptation based on modality confidence estimators. We assume different and time-varying environment noise, as can be encountered in realistic applications, and, for this, adaptive methods are best- suited. Stream reliability is assessed directly through classifier outputs since they are not specific to either noise type or level. The influence of constraining the weights to sum to one is also discussed.

Name

icmi08.pdf

Access type

openaccess

Size

524.92 KB

Format

Adobe PDF

Checksum (MD5)

cce8d75bbcad35f33679af22870be94a