A 3-D Audio-Visual Corpus of Affective Communication

Fanelli, Gabriele; Gall, Juergen; Romsdorfer, Harald; Weise, Thibaut; Van Gool, Luc

doi:10.1109/TMM.2010.2052239

Fanelli, Gabriele; Gall, Juergen; Romsdorfer, Harald; Weise, Thibaut; Van Gool, Luc

2010

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Communication between humans deeply relies on the capability of expressing and recognizing feelings. For this reason, research on human-machine interaction needs to focus on the recognition and simulation of emotional states, prerequisite of which is the collection of affective corpora. Currently available datasets still represent a bottleneck for the difficulties arising during the acquisition and labeling of affective data. In this work, we present a new audio-visual corpus for possibly the two most important modalities used by humans to communicate their emotional states, namely speech and facial expression in the form of dense dynamic 3-D face geometries. We acquire high-quality data by working in a controlled environment and resort to video clips to induce affective states. The annotation of the speech signal includes: transcription of the corpus text into the phonological representation, accurate phone segmentation, fundamental frequency extraction, and signal intensity estimation of the speech signals. We employ a real-time 3-D scanner to acquire dense dynamic facial geometries and track the faces throughout the sequences, achieving full spatial and temporal correspondences. The corpus is a valuable tool for applications like affective visual speech synthesis or view-independent facial expression recognition.

Details

Title A 3-D Audio-Visual Corpus of Affective Communication

Author(s) Fanelli, Gabriele ; Gall, Juergen ; Romsdorfer, Harald ; Weise, Thibaut ; Van Gool, Luc

Published in Ieee Transactions On Multimedia

Volume 12

Pages 591-598

Date 2010

ISSN 1520-9210

Keywords

Audio-visual database; emotional speech; face tracking; visual speech modeling; 3-D face modeling; Emotion; Speech; Recognition; Databases; Induction; States; Mood

DOI https://doi.org/10.1109/TMM.2010.2052239

Other identifier(s) View record in Web of Science

Laboratories LGG

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IC Archives > LGG - Computer Graphics and Geometry Laboratory
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2011-12-16

Actions

Preview

Select file: