Sparse Component Analysis for Speech Recognition in Multi-Speaker Environment

Asaei, Afsaneh; Bourlard, Hervé; Garner, Philip N.

doi:10.21437/Interspeech.2010-490

Asaei, Afsaneh; Bourlard, Hervé; Garner, Philip N.

2010

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Résumé

Sparse Component Analysis is a relatively young technique that relies upon a representation of signal occupying only a small part of a larger space. Mixtures of sparse components are disjoint in that space. As a particular application of sparsity of speech signals, we investigate the DUET blind source separation algorithm in the context of speech recognition for multi-party recordings. We show how DUET can be tuned to the particular case of speech recognition with interfering sources, and evaluate the limits of performance as the number of sources increases. We show that the separated speech fits a common metric for sparsity, and conclude that sparsity assumptions lead to good performance in speech separation and hence ought to benefit other aspects of the speech recognition chain.

Détails

Titre Sparse Component Analysis for Speech Recognition in Multi-Speaker Environment

Auteur(s) Asaei, Afsaneh ; Bourlard, Hervé ; Garner, Philip N.

Publié dans Interspeech 2010

Pages 1704-1707

Présenté à Interspeech, Makuhari, Japan

Date 2010

Mots-clés (libres)

Automatic Speech Recognition; Overlapping Speech; Sparse Component Analysis

DOI https://doi.org/10.21437/Interspeech.2010-490

Lien supplémentaire URL

Laboratoires LIDIAP

Le document apparaît dans Production scientifique et compétences > STI - Faculté des sciences et techniques de l'ingénieur > IEM - Institute of Electrical and Micro Engineering > LIDIAP - Laboratoire de l'IDIAP
Production scientifique et compétences > Euler Center for Signal Processing
Papiers de conférence
Travail produit à l'EPFL
Publié

Date de création de la notice 2010-08-26

Files

Résumé

Détails

PDF