Multimodal Speaker Localization from Omnidirectional Videos

Reuse, Pascal; Gurban, Mihai; Austvoll, Ivar; Thiran, Jean-Philippe

conference paper

Multimodal Speaker Localization from Omnidirectional Videos

Reuse, Pascal

•

Gurban, Mihai

•

Austvoll, Ivar

more

2009

Proceedings of the 17th European Signal Processing Conference

17th European Signal Processing Conference

The use of omnidirectional cameras for videoconferencing promises to simplify the hardware setup necessary for large groups of participants. We investigate the use of a multimodal speaker detection algorithm on audio-visual sequences captured with such a camera, in particular, an algorithm that uses the audio energy together with the optical flow. We analyze several types of optical flow methods to determine the one which is appropriate to the omnidirectional context.

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/42253

Name

omni.pdf

Access type

openaccess

Size

514.8 KB

Format

Adobe PDF

Checksum (MD5)

c621839fdebe6bf79f375e1a92133957