Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Multistream Speaker Diarization beyond Two Acoustic Feature Streams
 
conference paper

Multistream Speaker Diarization beyond Two Acoustic Feature Streams

Vijayasenan, Deepu  
•
Valente, Fabio
•
Bourlard, Hervé  
2010
2010 IEEE International Conference on Acoustics, Speech and Signal Processing
International Conference on Acoustics, Speech, and Signal Processing

Speaker diarization for meetings data are recently converging towards multistream systems. The most common complementary features used in combination with MFCC are Time Delay of Arrival (TDOA). Also other features have been proposed although, there are no reported improvements on top of MFCC+TDOA systems. In this work we investigate the combination of other feature sets along with MFCC+TDOA. We discuss issues and problems related to the weighting of four different streams proposing a solution based on a smoothed version of the speaker error. Experiments are presented on NIST RT06 meeting diarization evaluation. Results reveal that the combination of four acoustic feature streams results in a 30% relative improvement with respect to the MFCC+TDOA feature combination. To the authors’ best knowledge, this is the first successful attempt to improve the MFCC+TDOA baseline including other feature streams.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ICASSP.2010.5495086
Author(s)
Vijayasenan, Deepu  
Valente, Fabio
Bourlard, Hervé  
Date Issued

2010

Published in
2010 IEEE International Conference on Acoustics, Speech and Signal Processing
Start page

4950

End page

4953

Subjects

Speaker Diarization

Written at

EPFL

EPFL units
LIDIAP  
Event name
International Conference on Acoustics, Speech, and Signal Processing
Available on Infoscience
February 11, 2010
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/47182
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés