Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Estimating the dominant person in multi-party conversations using speaker diarization strategies
 
conference paper

Estimating the dominant person in multi-party conversations using speaker diarization strategies

Hung, Hayley
•
Huang, Yan
•
Friedland, Gerald
Show more
2008
2008 IEEE International Conference on Acoustics, Speech and Signal Processing
IEEE International Conference on Acoustics, Speech, and Signal Processing

In this paper, we apply speaker diarization strategies from a single source to the task of estimating the dominant person in a group meeting. Previous work has shown that speaking length is strongly correlated with perceived dominance. Here we investigate this in more depth by considering two dominance tasks where there is full and majority agreement amongst ground-truth annotators. In addition, we investigate how 24 different speed-up and algorithmic strategies, and source types lead to interesting outcomes when applied to dominance estimation. We obtained the best performance of $77%$ using our slowest scheme and a single distant microphone (SDM). Within the top 3 out of 24 performing experiments in both dominance tasks, we show that we can use the furthest SDM, with no prior knowledge of the number of speakers and the fastest diarization scheme, which performs $1.3$ times faster than real-time.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ICASSP.2008.4518080
Author(s)
Hung, Hayley
Huang, Yan
Friedland, Gerald
Gatica-Perez, Daniel  
Date Issued

2008

Published in
2008 IEEE International Conference on Acoustics, Speech and Signal Processing
Start page

2197

End page

2200

URL

Related documents

http://publications.idiap.ch/index.php/publications/showcite/hung:rr07-60
Written at

EPFL

EPFL units
LIDIAP  
Event name
IEEE International Conference on Acoustics, Speech, and Signal Processing
Available on Infoscience
February 11, 2010
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/46829
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés