Information theoretic clustering for unsupervised domain-adaptation

Dey, Subhadeep; Madikeri, Srikanth; Motlicek, Petr

doi:10.1109/ICASSP.2016.7472745

conference paper

Information theoretic clustering for unsupervised domain-adaptation

Dey, Subhadeep

•

Madikeri, Srikanth

•

Motlicek, Petr

2016

2016 Ieee International Conference On Acoustics, Speech And Signal Processing Proceedings

Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016)

The aim of the domain-adaptation task for speaker verification is to exploit unlabelled target domain data by using the labelled source domain data effectively. The i-vector based Probabilistic Linear Dis- criminant Analysis (PLDA) framework approaches this task by clus- tering the target domain data and using each cluster as a unique speaker to estimate PLDA model parameters. These parameters are then combined with the PLDA parameters from the source domain. Typically, agglomerative clustering with cosine distance measure is used. In tasks such as speaker diarization that also require unsuper- vised clustering of speakers, information-theoretic clustering mea- sures have been shown to be effective. In this paper, we employ the Information Bottleneck (IB) clustering technique to find speaker clusters in the target domain data. This is achieved by optimizing the IB criterion that minimizes the information loss during the cluster- ing process. The greedy optimization of the IB criterion involves ag- glomerative clustering using the Jensen-Shannon divergence as the distance metric. Our experiments in the domain-adaptation task in- dicate that the proposed system outperforms the baseline by about 14% relative in terms of equal error rate.

Type

conference paper

DOI

10.1109/ICASSP.2016.7472745

Web of Science ID

WOS:000388373405146

Author(s)

Dey, Subhadeep

Madikeri, Srikanth

Motlicek, Petr

Date Issued

2016

Publisher

IEEE

Publisher place

New York

Published in

2016 Ieee International Conference On Acoustics, Speech And Signal Processing Proceedings

ISBN of the book

978-1-4799-9988-0

Total of pages

5

Start page

5580

End page

5584

Subjects

Speaker verification

•

Domain adaptation

•

Information theoretic measures

•

PLDA model

URL

Event name	Event place
Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016)	Shanghai