Floor Holder Detection and End of Speaker Turn Prediction in Meetings

Dielmann, Alfred; Garau, Giulia; Bourlard, Hervé

doi:10.21437/Interspeech.2010-632

Dielmann, Alfred; Garau, Giulia; Bourlard, Hervé

2010

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

We propose a novel fully automatic framework to detect which meeting participant is currently holding the conversational floor and when the current speaker turn is going to finish. Two sets of experiments were conducted on a large collection of multiparty conversations: the AMI meeting corpus. Unsupervised speaker turn detection was performed by post-processing the speaker diarization and the speech activity detection outputs. A supervised end-of-speaker-turn prediction framework, based on Dynamic Bayesian Networks and automatically extracted multimodal features (related to prosody, overlapping speech, and visual motion), was also investigated. These novel approaches resulted in good floor holder detection rates (13:2% Floor Error Rate), attaining state of the art end-of-speaker-turn prediction performances.

Details

Title Floor Holder Detection and End of Speaker Turn Prediction in Meetings

Author(s) Dielmann, Alfred ; Garau, Giulia ; Bourlard, Hervé

Published in Interspeech 2010

Pages 2306-2309

Conference International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan

Date 2010

Publisher ISCA

Keywords

Dynamic Bayesian Network; floor control; Multiparty Conversation; non-verbal features; speaker turn

DOI https://doi.org/10.21437/Interspeech.2010-632

Additional link URL

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Conference Papers
Work produced at EPFL
Published

Record creation date 2010-08-26

Actions

Preview

Select file: