Exploiting sequence information for text-dependent Speaker Verification

Dey, Subhadeep; Motlicek, Petr; Madikeri, Srikanth; Ferras, Marc

doi:10.1109/ICASSP.2017.7953182

Dey, Subhadeep; Motlicek, Petr; Madikeri, Srikanth; Ferras, Marc

2017

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

Model-based approaches to Speaker Verification (SV), such as Joint Factor Analysis (JFA), i-vector and relevance Maximum-a-Posteriori (MAP), have shown to provide state-of-the-art performance for text-dependent systems with fixed phrases. The performance of i-vector and JFA models has been further enhanced by estimating posteriors from Deep Neural Network (DNN) instead of Gaussian Mixture Model (GMM). While both DNNs and GMMs aim at incorporating phonetic information of the phrase with these posteriors, model-based SV approaches ignore the sequence information of the phonetic units of the phrase. In this paper, we tackle this issue by applying dynamic time warping using speaker-informative features. We propose to use i-vectors computed from short segments of each speech utterance, also called online i-vectors, as feature vectors. The proposed approach is evaluated on the RedDots database and provides an improvement of 75% relative equal error rate over the best model-based SV baseline system in a content-mismatch condition.

Details

Title Exploiting sequence information for text-dependent Speaker Verification

Author(s) Dey, Subhadeep ; Motlicek, Petr ; Madikeri, Srikanth ; Ferras, Marc

Published in 2017 Ieee International Conference On Acoustics, Speech And Signal Processing (Icassp)

Pagination 5

Pages 5370-5374

Conference Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, New Orleans, LA, March 05-09, 2017

Date 2017

Publisher New York, Ieee

ISSN 1520-6149

ISBN 978-1-5090-4117-6

Keywords

Text-dependent speaker verification; DNN posteriors; Dynamic Time Warping

DOI https://doi.org/10.1109/ICASSP.2017.7953182

Other identifier(s) View record in Web of Science

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Record creation date 2016-12-19