Robust and Discriminative Speaker Embedding via Intra-Class Distance Variance Regularization

Le, Nam; Odobez, Jean-Marc

doi:10.21437/Interspeech.2018-1685

conference paper

Robust and Discriminative Speaker Embedding via Intra-Class Distance Variance Regularization

Le, Nam

•

Odobez, Jean-Marc

2018

19Th Annual Conference Of The International Speech Communication Association (Interspeech 2018), Vols 1-6

Proceedings of Interspeech

Learning a good speaker embedding is critical for many speech processing tasks, including recognition, verification, and diarization. To this end, we propose a complementary optimizing goal called intra-class loss to improve deep speaker embed dings learned with triplet loss. This loss function is formulated as a soft constraint on the averaged pair-wise distance between samples from the same class. Its goal is to prevent the scattering of these samples within the embedding space to increase the intra-class compactncss.When intra-class loss is jointly optimized with triplet loss, we can observe 2 major improvements: the deep embedding network can achieve a more robust and discriminative representation and the training process is more stable with a faster convergence rate. We conduct experiments on 2 large public benchmarking datasets for speaker verification, VoxCeleb and VoxForge. The results show that intra-class loss helps accelerating the convergence of deep network training and significantly improves the overall performance of the resulted embeddings.

Type

conference paper

DOI

10.21437/Interspeech.2018-1685

Web of Science ID

WOS:000465363900473

Author(s)

Le, Nam

Odobez, Jean-Marc

Date Issued

2018

Publisher

ISCA-INT SPEECH COMMUNICATION ASSOC

Publisher place

Baixas

Published in

19Th Annual Conference Of The International Speech Communication Association (Interspeech 2018), Vols 1-6

ISBN of the book

978-1-5108-7221-9

Series title/Series vol.

Interspeech

Start page

2257

End page

2261

Subjects

speaker verification

•

deep neural networks

•

embedding learning

•

triplet loss

URL

Event name	Event place	Event date
Proceedings of Interspeech	Hyderabad, INDIA	Aug 02-Sep 06, 2018