Multi-party Focus of Attention Recognition in Meetings from Head Pose and Multimodal Contextual Cues

Ba, Silèye O.; Odobez, Jean-Marc

doi:10.1109/ICASSP.2008.4518086

Ba, Silèye O.; Odobez, Jean-Marc

2008

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

We address the problem of recognizing the visual focus of attention (VFOA) of meeting participants from their head pose and contextual cues. The main contribution of the paper is the use of a head pose posterior distribution as a representation of the head pose information contained in the image data. This posterior encodes the probabilities of the different head poses given the image data, and constitute therefore a richer representation of the data than the mean or the mode of this distribution, as done in all previous work. These observations are exploited in a joint interaction model of all meeting participants pose observations, VFOAs, speaking status and of environmental contextual cues. Numerical experiments on a public database of 4 meetings of 22min on average show that this change of representation allows for a 5.4% gain with respect to the standard approach using head pose as observation.

Details

Title Multi-party Focus of Attention Recognition in Meetings from Head Pose and Multimodal Contextual Cues

Author(s) Ba, Silèye O. ; Odobez, Jean-Marc

Published in 2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pages 2221-2224

Date 2008

Note IDIAP-RR 07-50

DOI https://doi.org/10.1109/ICASSP.2008.4518086

Additional link URL; Related documents

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Conference Papers
Work produced at EPFL
Published

Record creation date 2010-02-11

Actions

Preview

Select file: