Unknown-Multiple Speaker clustering using HMM

Ajmera, Jitendra; Bourlard, Hervé; Lapidot, I.; McCowan, Iain A.

Ajmera, Jitendra; Bourlard, Hervé; Lapidot, I.; McCowan, Iain A.

2002

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Fichiers

Résumé

An HMM-based speaker clustering framework is presented, where the number of speakers and segmentation boundaries are unknown \emph{a priori}. Ideally, the system aims to create one pure cluster for each speaker. The HMM is ergodic in nature with a minimum duration topology. The final number of clusters is determined automatically by merging closest clusters and retraining this new cluster, until a decrease in likelihood is observed. In the same framework, we also examine the effect of using only the features from highly voiced frames as a means of improving the robustness and computational complexity of the algorithm. The proposed system is assessed on the 1996 HUB-4 evaluation test set in terms of both cluster and speaker purity. It is shown that the number of clusters found often correspond to the actual number of speakers.

Détails

Titre Unknown-Multiple Speaker clustering using HMM

Auteur(s) Ajmera, Jitendra ; Bourlard, Hervé ; Lapidot, I. ; McCowan, Iain A.

Date 2002

Editeur Martigny, Switzerland, IDIAP

Mots-clés (libres)

speech; ajmera; bourlard; lapidot; mccowan

Note ICSLP, Denver, Colorado, 2002

Lien supplémentaire URL

Laboratoires LIDIAP

Le document apparaît dans Production scientifique et compétences > STI - Faculté des sciences et techniques de l'ingénieur > IEM - Institute of Electrical and Micro Engineering > LIDIAP - Laboratoire de l'IDIAP
Production scientifique et compétences > Euler Center for Signal Processing
Travail produit à l'EPFL
Rapports techniques
Publié

Date de création de la notice 2006-03-10

Actions

Aperçu

Sélectionner le fichier :