Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Reports, Documentation, and Standards
  4. The Speed Submission to DIHARD II: Contributions & Lessons Learned
 
report

The Speed Submission to DIHARD II: Contributions & Lessons Learned

Sahidullah, Md
•
Patino, Jose
•
Cornell, Samuele
Show more
2019

This paper describes the speaker diarization systems developed for the Second DIHARD Speech Diarization Challenge (DIHARD II) by the Speed team. Besides describing the system, which considerably outperformed the challenge baselines, we also focus on the lessons learned from numerous approaches that we tried for single and multi-channel systems. We present several components of our diarization system, including categorization of domains, speech enhancement, speech activity detection, speaker embeddings, clustering methods, resegmentation, and system fusion. We analyze and discuss the effect of each such component on the overall diarization performance within the realistic settings of the challenge.

  • Details
  • Metrics
Type
report
Author(s)
Sahidullah, Md
Patino, Jose
Cornell, Samuele
Yin, Ruiqing
Sivasankaran, Sunit
Bredin, Herve
Korshunov, Pavel
Brutti, Alessio
Serizel, Romain
Vincent, Emmanuel
Show more
Date Issued

2019

Publisher

Idiap

Subjects

diarization

•

DIHARD challenge

•

evaluation

•

single-channel and multi-channel speech

URL
http://publications.idiap.ch/downloads/reports/2019/Sahidullah_Idiap-RR-14-2019.pdf
Written at

EPFL

EPFL units
LIDIAP  
Available on Infoscience
February 18, 2020
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/166350
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés