Multimodal Speaker Localization from Omnidirectional Videos

Reuse, Pascal; Gurban, Mihai; Austvoll, Ivar; Thiran, Jean-Philippe

Reuse, Pascal; Gurban, Mihai; Austvoll, Ivar; Thiran, Jean-Philippe

2009

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

The use of omnidirectional cameras for videoconferencing promises to simplify the hardware setup necessary for large groups of participants. We investigate the use of a multimodal speaker detection algorithm on audio-visual sequences captured with such a camera, in particular, an algorithm that uses the audio energy together with the optical flow. We analyze several types of optical flow methods to determine the one which is appropriate to the omnidirectional context.

Details

Title Multimodal Speaker Localization from Omnidirectional Videos

Author(s) Reuse, Pascal ; Gurban, Mihai ; Austvoll, Ivar ; Thiran, Jean-Philippe

Published in Proceedings of the 17th European Signal Processing Conference

Conference 17th European Signal Processing Conference, Glasgow, UK, August 24-28, 2009

Date 2009

Keywords

LTS5

Laboratories LTS5

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LTS5 - Signal Processing Laboratory 5
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Record creation date 2009-08-27

Actions

Preview

Select file: