Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. INTRA-CLASS COVARIANCE ADAPTATION IN PLDA BACK-ENDS FOR SPEAKER VERIFICATION
 
conference paper

INTRA-CLASS COVARIANCE ADAPTATION IN PLDA BACK-ENDS FOR SPEAKER VERIFICATION

Madikeri, Srikanth
•
Ferras, Marc
•
Motlicek, Petr
Show more
2017
2017 Ieee International Conference On Acoustics, Speech And Signal Processing (Icassp)
Proceedings of International Conference on Acoustics, Speech and Signal Processing

Multi-session training conditions are becoming increasingly common in recent benchmark datasets for both text-independent and text-dependent speaker verification. In the state-of-the-art i-vector framework for speaker verification, such conditions are addressed by simple techniques such as averaging the individual i-vectors, averaging scores, or modifying the Probabilistic Linear Discriminant Analysis (PLDA) scoring hypothesis for multi-session enrollment. The aforementioned techniques fail to exploit the speaker variabilities observed in the enrollment data for target speakers. In this paper, we propose to exploit the multi-session training data by estimating a speaker-dependent covariance matrix and updating the intra-speaker covariance during PLDA scoring for each target speaker. The proposed method is further extended by combining covariance adaptation and score averaging. In this method, the individual examples of the target speaker are compared against the test data as opposed to an averaged i-vector, and the scores obtained are then averaged. The proposed methods are evaluated on the NIST SRE 2012 dataset. Relative improvements of up to 29% in equal error rate are obtained.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ICASSP.2017.7953181
Web of Science ID

WOS:000414286205105

Author(s)
Madikeri, Srikanth
Ferras, Marc
Motlicek, Petr
Dey, Subhadeep
Date Issued

2017

Publisher

Ieee

Publisher place

New York

Published in
2017 Ieee International Conference On Acoustics, Speech And Signal Processing (Icassp)
ISBN of the book

978-1-5090-4117-6

Total of pages

5

Start page

5365

End page

5369

Subjects

i-vectors

•

PLDA

•

multi-session training

URL

Related documents

http://publications.idiap.ch/index.php/publications/showcite/Madikeri_Idiap-RR-05-2017
Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LIDIAP  
Event name
Proceedings of International Conference on Acoustics, Speech and Signal Processing
Available on Infoscience
December 19, 2017
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/143479
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés