Modified group delay feature based total variability space modelling for speaker recognition

Madikeri, Srikanth R.; Talambedu, Asha; Murthy, Hema A.

doi:10.1007/s10772-014-9243-7

Madikeri, Srikanth R.; Talambedu, Asha; Murthy, Hema A.

2015

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Résumé

In this paper, modified group delay (MODGD) features are used to model target speakers in the Total Variability Space (TVS) framework for speaker recognition. MODGD based features have been shown to improve speaker recognition performance owing to the ability of group delay functions to emphasise formants. The basis vectors of TVS are estimated using the PPCA algorithm while i-vectors for a speaker are extracted using the conventional technique. The estimation of the total variability space is simplified by a simple transformation of the supervectors. This results in a significant speed up in the estimation of hyperparameters of TVS as the computational complexity of PPCA algorithm is simpler compared to that of the conventaional procedure. This is important as the estimation procedure needs to handle large amounts data for estimation. The technique has already been shown to provide a speed up of 16×. The performance of the MODGD-based system is compared with that of the MFCC based system on the NIST SRE 2010 benchmark dataset. Two types of fusions are tested in this work—systems fused at the i-vector level and at the score level. A considerable performance improvement is observed in terms of the EER (Equal Error Rate) by employing these fusion techniques. A robust speaker recognition system with decreased development time is obtained as a result.

Détails

Titre Modified group delay feature based total variability space modelling for speaker recognition

Auteur(s) Madikeri, Srikanth R. ; Talambedu, Asha ; Murthy, Hema A.

Publié dans International Journal of Speech Technology

Volume 18

Numéro 1

Pages 17-23

Date 2015

ISSN 1572-8110

DOI https://doi.org/10.1007/s10772-014-9243-7

Laboratoires LIDIAP

Le document apparaît dans Production scientifique et compétences > STI - Faculté des sciences et techniques de l'ingénieur > IEM - Institute of Electrical and Micro Engineering > LIDIAP - Laboratoire de l'IDIAP
Production scientifique et compétences > Euler Center for Signal Processing
Publications validées par des pairs
Travail produit à l'EPFL
Articles de journaux
Publié

Date de création de la notice 2014-09-18

Résumé

Détails

Actions