Automatic Speaker Role Labeling in AMI Meetings: Recognition of Formal and Social Roles

Sapru, A.; Valente, Fabio

doi:10.1109/ICASSP.2012.6289057

Sapru, A.; Valente, Fabio

2012

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

This work aims at investigating the automatic recognition of speaker role in meeting conversations from the AMI corpus. Two types of roles are considered: formal roles, fixed over the meeting duration and recognized at recording level, and social roles related to the way participants interact between themselves, recognized at speaker turn level. Various structural, lexical and prosodic features as well as Dialog Act tags are exhaustively investigated and combined for this purpose. Results reveal an accuracy of 74% in recognizing the speakers formal roles and an accuracy of 66% (percentage of time) in correctly labeling the social roles. Feature analysis reveals that lexical features provide the higher performances in formal/functional role recognition while prosodic features provide the higher performances in social role recognition. Furthermore results reveal that social role recognition in case of rare roles in the corpus can be improved through the use of lexical and Dialog Act information combined over short time windows.

Details

Title Automatic Speaker Role Labeling in AMI Meetings: Recognition of Formal and Social Roles

Author(s) Sapru, A. ; Valente, Fabio

Published in 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pages 5057-5060

Conference IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 25-30 March 2012

Date 2012

Keywords

AMI Meetings; Speaker Role Labeling

DOI https://doi.org/10.1109/ICASSP.2012.6289057

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Conference Papers
Work produced at EPFL

Record creation date 2013-12-19

Actions

Preview

Select file: