Real-Time Audio-Visual Analysis for Multiperson Videoconferencing

Motlicek, Petr; Duffner, Stefan; Korchagin, Danil; Bourlard, Hervé; Scheffler, Carl; Odobez, Jean-Marc; Del Galdo, Giovanni; Kallinger, Markus; Thiergart, Oliver

doi:10.1155/2013/175745

Motlicek, Petr; Duffner, Stefan; Korchagin, Danil; Bourlard, Hervé; Scheffler, Carl; Odobez, Jean-Marc; Del Galdo, Giovanni; Kallinger, Markus; Thiergart, Oliver

2013

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

We describe the design of a system consisting of several state-of-the-art real-time audio and video processing components enabling multimodal stream manipulation (e.g., automatic online editing for multiparty videoconferencing applications) in open, unconstrained environments. The underlying algorithms are designed to allow multiple people to enter, interact, and leave the observable scene with no constraints.They comprise continuous localisation of audio objects and its application for spatial audio object coding, detection, and tracking of faces, estimation of head poses and visual focus of attention, detection and localisation of verbal and paralinguistic events, and the association and fusion of these different events. Combined all together, they represent multimodal streams with audio objects and semantic video objects and provide semantic information for stream manipulation systems (like a virtual director). Various experiments have been performed to evaluate the performance of the system.The obtained results demonstrate the effectiveness of the proposed design, the various algorithms, and the benefit of fusing different modalities in this scenario.

Details

Title Real-Time Audio-Visual Analysis for Multiperson Videoconferencing

Author(s) Motlicek, Petr ; Duffner, Stefan ; Korchagin, Danil ; Bourlard, Hervé ; Scheffler, Carl ; Odobez, Jean-Marc ; Del Galdo, Giovanni ; Kallinger, Markus ; Thiergart, Oliver

Published in Advances in Multimedia

Volume 2013

Pages 175745

Date 2013

Note Hindawi Publishing Corporation, Article ID 175745

DOI https://doi.org/10.1155/2013/175745

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2013-12-19

Actions

Preview

Select file: