Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. A Bayesian Approach To Inter-Task Fusion For Speaker Recognition
 
conference paper

A Bayesian Approach To Inter-Task Fusion For Speaker Recognition

Madikeri, Srikanth
•
Motlicek, Petr  
•
Dey, Subhadeep
January 1, 2019
2019 Ieee International Conference On Acoustics, Speech And Signal Processing (Icassp)
44th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In i-vector based speaker recognition systems, back-end classifiers are trained to factor out nuisance information and retain only the speaker identity. As a result, variabilities arising due to gender, language and accent ( among many others) are suppressed. Inter-task fusion, in which such metadata information obtained from automatic systems is used, has been shown to improve speaker recognition performance. In this paper, we explore a Bayesian approach towards inter-task fusion. Speaker similarity score for a test recording is obtained by marginalizing the posterior probability of a speaker. Gender and language probabilities for the test audio are combined with speaker posteriors to obtain a final speaker score. The proposed approach is demonstrated for speaker verification and speaker identification tasks on the NIST SRE 2008 dataset. Relative improvements of up to 10% and 8% are obtained when fusing gender and language information, respectively.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ICASSP.2019.8683658
Web of Science ID

WOS:000482554006003

Author(s)
Madikeri, Srikanth
Motlicek, Petr  
Dey, Subhadeep
Date Issued

2019-01-01

Publisher

IEEE

Publisher place

New York

Published in
2019 Ieee International Conference On Acoustics, Speech And Signal Processing (Icassp)
ISBN of the book

978-1-4799-8131-1

Start page

5786

End page

5790

Subjects

inter-task fusion

•

bayesian fusion

•

speaker recognition

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LIDIAP  
Event nameEvent placeEvent date
44th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Brighton, ENGLAND

May 12-17, 2019

Available on Infoscience
September 26, 2019
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/161541
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés