Automatic Analysis of Multimodal Group Actions in Meetings

McCowan, Iain A.; Gatica-Perez, Daniel; Bengio, Samy; Lathoud, Guillaume; Barnard, Mark; Zhang, Dong

McCowan, Iain A.; Gatica-Perez, Daniel; Bengio, Samy; Lathoud, Guillaume; Barnard, Mark; Zhang, Dong

2003

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

This paper investigates the recognition of group actions in meetings. A statistical framework is proposed in which group actions result from the interactions of the individual participants. The group actions are modelled using different HMM-based approaches, where the observations are provided by a set of audio-visual features monitoring the actions of individuals. Experiments demonstrate the importance of taking interactions into account in modelling the group actions. It is also shown that the visual modality contains useful information, even for predominantly audio-based events, motivating a multimodal approach to meeting analysis.

Details

Title Automatic Analysis of Multimodal Group Actions in Meetings

Author(s) McCowan, Iain A. ; Gatica-Perez, Daniel ; Bengio, Samy ; Lathoud, Guillaume ; Barnard, Mark ; Zhang, Dong

Date 2003

Publisher Martigny, Switzerland, IDIAP

Keywords

speech; vision; learning; mccowan; gatica; lathoud; bengio; zhang; barnard

Note To appear in IEEE Transactions of Pattern Analysis and Machine Intelligence

Additional link URL

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Work produced at EPFL
Technical Reports
Published

Record creation date 2006-03-10

Actions

Preview

Select file: