Estimating the dominant person in multi-party conversations using speaker diarization strategies

Hung, Hayley; Huang, Yan; Friedland, Gerald; Gatica-Perez, Daniel

doi:10.1109/ICASSP.2008.4518080

Hung, Hayley; Huang, Yan; Friedland, Gerald; Gatica-Perez, Daniel

2008

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

In this paper, we apply speaker diarization strategies from a single source to the task of estimating the dominant person in a group meeting. Previous work has shown that speaking length is strongly correlated with perceived dominance. Here we investigate this in more depth by considering two dominance tasks where there is full and majority agreement amongst ground-truth annotators. In addition, we investigate how 24 different speed-up and algorithmic strategies, and source types lead to interesting outcomes when applied to dominance estimation. We obtained the best performance of $77\%$ using our slowest scheme and a single distant microphone (SDM). Within the top 3 out of 24 performing experiments in both dominance tasks, we show that we can use the furthest SDM, with no prior knowledge of the number of speakers and the fastest diarization scheme, which performs $1.3$ times faster than real-time.

Details

Title Estimating the dominant person in multi-party conversations using speaker diarization strategies

Author(s) Hung, Hayley ; Huang, Yan ; Friedland, Gerald ; Gatica-Perez, Daniel

Published in 2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pages 2197-2200

Conference IEEE International Conference on Acoustics, Speech, and Signal Processing

Date 2008

DOI https://doi.org/10.1109/ICASSP.2008.4518080

Additional link Related documents

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Conference Papers
Work produced at EPFL
Published

Record creation date 2010-02-11

Abstract

Details

Actions