Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Incremental Transfer Learning in Two-pass Information Bottleneck Based Speaker Diarization System for Meetings
 
conference paper

Incremental Transfer Learning in Two-pass Information Bottleneck Based Speaker Diarization System for Meetings

Dawalatabad, Nauman
•
Madikeri, Srikanth
•
Murthy, Hema A
Show more
2019
Proceedings of ICASSP 2019
International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

The two-pass information bottleneck (TPIB) based speaker diarization system operates independently on different conversational recordings. TPIB system does not consider previously learned speaker discriminative information while di-arizing new conversations. Hence, the real time factor (RTF) of TPIB system is high owing to the training time required for the artificial neural network (ANN). This paper attempts to improve the RTF of the TPIB system using an incremental transfer learning approach where the parameters learned by the ANN from other conversations are updated using current conversation rather than learning parameters from scratch. This reduces the RTF significantly. The effectiveness of the proposed approach compared to the baseline IB and the TPIB systems is demonstrated on standard NIST and AMI conversational meeting datasets. With a minor degradation in performance, the proposed system shows a significant improvement of 33.07% and 24.45% in RTF with respect to TPIB system on the NIST RT-04Eval and AMI-1 datasets, respectively.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ICASSP.2019.8683114
Author(s)
Dawalatabad, Nauman
Madikeri, Srikanth
Murthy, Hema A
Sekhar, C Chandra
Date Issued

2019

Published in
Proceedings of ICASSP 2019
Start page

6291

End page

6295

Written at

EPFL

EPFL units
LIDIAP  
Event name
International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
Available on Infoscience
February 18, 2020
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/166316
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés