Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Reports, Documentation, and Standards
  4. An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization
 
report

An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization

Vijayasenan, Deepu  
•
Valente, Fabio
•
Bourlard, Hervé  
2010

This work describes a novel system for speaker diarization of meetings recordings based on the combination of acoustic features (MFCC) and Time Delay of Arrivals (TDOA). The first part of the paper analyzes differences between MFCC and TDOA features which possess completely different statistical properties. When Gaussian Mixture Models are used, experiments reveal that the diarization system is sensitive to the different recording scenarios (i.e. meeting rooms with varying number of microphones). In the second part, a new multistream diarization system is proposed extending previous work on Information Theoretic diarization. Both speaker clustering and speaker realignment steps are discussed; in contrary to current systems, the proposed method avoids to perform the feature combination averaging log-likelihood scores. Experiments on meetings data reveal that the proposed approach outperforms the GMM based system when the recording is done with varying number of microphones.

  • Details
  • Metrics
Type
report
Author(s)
Vijayasenan, Deepu  
Valente, Fabio
Bourlard, Hervé  
Date Issued

2010

Publisher

Idiap

Written at

EPFL

EPFL units
LIDIAP  
Available on Infoscience
August 26, 2010
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/52484
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés