Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Improved overlap speech diarization of meeting recordings using long-term conversational features
 
conference paper

Improved overlap speech diarization of meeting recordings using long-term conversational features

Yella, Sree Harsha  
•
Bourlard, Hervé
2013
2013 IEEE International Conference on Acoustics, Speech and Signal Processing
ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Overlapping speech is a source of significant errors in speaker diarization of spontaneous meeting recordings. Recent works on speaker diarization have attempted to solve the problem of overlap detection using classifiers trained on acoustic and spatial features. This paper proposes a method to improve the short-term spectral feature based overlap detector by incorporating information from long-term conversational features in the form of speaker change statistics. The statistics are obtained at segment level(around few seconds) from the output of a diarization system. The approach is motivated by the observation that segments containing more speaker changes are more probable to have more overlaps. Experiments on AMI meeting corpus reveal that the number of overlaps in a segment follows a Poisson distribution whose rate is directly proportional to the number of speaker changes in the segment. When this information is combined with acoustic information in an HMM/GMM overlap detector, improvements are verified in terms of F-measure and consequently, diarization error (DER) is reduced by 5% relative to the baseline overlap detector.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ICASSP.2013.6639171
Author(s)
Yella, Sree Harsha  
Bourlard, Hervé
Date Issued

2013

Publisher

IEEE

Published in
2013 IEEE International Conference on Acoustics, Speech and Signal Processing
ISBN of the book

978-1-4799-0356-6

Start page

7746

End page

7750

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LIDIAP  
Event nameEvent placeEvent date
ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Vancouver, BC, Canada

26-31 05 2013

Available on Infoscience
December 19, 2013
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/98541
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés