Multimodal Speaker Localization from Omnidirectional Videos

The use of omnidirectional cameras for videoconferencing promises to simplify the hardware setup necessary for large groups of participants. We investigate the use of a multimodal speaker detection algorithm on audio-visual sequences captured with such a camera, in particular, an algorithm that uses the audio energy together with the optical flow. We analyze several types of optical flow methods to determine the one which is appropriate to the omnidirectional context.


Published in:
Proceedings of the 17th European Signal Processing Conference
Presented at:
17th European Signal Processing Conference, Glasgow, UK, August 24-28, 2009
Keywords:
Laboratories:




 Record created 2009-08-27, last modified 2018-09-13

n/a:
Download fulltext
PDF

Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)