Multimodal Integration for Meeting Group Action Segmentation and Recognition

Al-Hames, Marc; Dielmann, Alfred; Gatica-Perez, Daniel; Reiter, Stephan; Renals, Steve; Zhang, Dong

Al-Hames, Marc; Dielmann, Alfred; Gatica-Perez, Daniel; Reiter, Stephan; Renals, Steve; Zhang, Dong

2005

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

We address the problem of segmentation and recognition of sequences of multimodal human interactions in meetings. These interactions can be seen as a rough structure of a meeting, and can be used either as input for a meeting browser or as a first step towards a higher semantic analysis of the meeting. A common lexicon of multimodal group meeting actions, a shared meeting data set, and a common evaluation procedure enable us to compare the different approaches. We compare three different multimodal feature sets and four modelling infrastructures: a higher semantic feature approach, multi-layer HMMs, a multi-stream DBN, as well as a multi-stream mixed-state DBN for disturbed data.

Details

Title Multimodal Integration for Meeting Group Action Segmentation and Recognition

Author(s) Al-Hames, Marc ; Dielmann, Alfred ; Gatica-Perez, Daniel ; Reiter, Stephan ; Renals, Steve ; Zhang, Dong

Date 2005

Publisher Martigny, Switzerland, IDIAP

Keywords

vision; zhang

Note Published in ``MLMI'', July, 2005

Additional link URL

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Work produced at EPFL
Technical Reports
Published

Record creation date 2006-03-10

Actions

Preview

Select file: