Template-matching for text-dependent speaker verification

Dey, Subhadeep; Motlicek, Petr; Madikeri, Srikanth; Ferras, Marc

doi:10.1016/j.specom.2017.01.009

Dey, Subhadeep; Motlicek, Petr; Madikeri, Srikanth; Ferras, Marc

2017

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

In the last decade, i-vector and Joint Factor Analysis (JFA) approaches to speaker modeling have become ubiquitous in the area of automatic speaker recognition. Both of these techniques involve the computation of posterior probabilities, using either Gaussian Mixture Models (GMM) or Deep Neural Networks (DNN), as a prior step to estimating i-vectors or speaker factors. GMMs focus on implicitly modeling phonetic information of acoustic features while DNNs focus on explicitly modeling phonetic/linguistic units. For text-dependent speaker verification, DNN-based systems have considerably outperformed GMM for fixed-phrase tasks. However, both approaches ignore phone sequence information. In this paper, we aim at exploiting this information by using Dynamic Time Warping (DTW) with speaker-informative features. These features are obtained from i-vector models extracted over short speech segments, also called online i-vectors. Probabilistic Linear Discriminant Analysis (PLDA) is further used to project online i-vectors onto a speaker-discriminative subspace. The proposed DTW approach obtained at least 74% relative improvement in equal error rate on the RSR corpus over other state-of-the-art approaches, including i-vector and JFA.

Details

Title Template-matching for text-dependent speaker verification

Author(s) Dey, Subhadeep ; Motlicek, Petr ; Madikeri, Srikanth ; Ferras, Marc

Published in Speech Communication

Volume 88

Pages 96-105

Date 2017

ISSN 0167-6393

DOI https://doi.org/10.1016/j.specom.2017.01.009

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2017-07-19

Abstract

Details

Actions