Unsupervised Spectral Substraction for Noise-Robust ASR

Lathoud, Guillaume; Magimai.-Doss, Mathew; Mesot, Bertrand; Bourlard, Hervé

Lathoud, Guillaume; Magimai.-Doss, Mathew; Mesot, Bertrand; Bourlard, Hervé

2005

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Résumé

This paper proposes a simple, computationally efficient \mbox{2-mixture} model approach to discriminate between speech and background noise at the magnitude spectrogram level. It is directly derived from observations on real data, and can be used in a fully unsupervised manner, with the EM algorithm. In this paper, the 2-mixture model is used in an ``Unsupervised Spectral Substraction'' scheme that can be applied as a pre-processing step for any acoustic feature extraction scheme, such as MFCCs or PLP. The goal is to improve noise-robustness of the acoustic features. Experimental results on both OGI~Numbers~95 and Aurora~2 tasks yielded a major improvement on all noise conditions, while retaining a similar performance on clean conditions.

Détails

Titre Unsupervised Spectral Substraction for Noise-Robust ASR

Auteur(s) Lathoud, Guillaume ; Magimai.-Doss, Mathew ; Mesot, Bertrand ; Bourlard, Hervé

Date 2005

Editeur Martigny, Switzerland, IDIAP

Mots-clés (libres)

speech; lathoud; mathew; bmesot; bourlard

Note Published in Proceedings of the 2005 IEEE ASRU Workshop

Lien supplémentaire URL

Laboratoires LIDIAP

Le document apparaît dans Production scientifique et compétences > STI - Faculté des sciences et techniques de l'ingénieur > IEM - Institute of Electrical and Micro Engineering > LIDIAP - Laboratoire de l'IDIAP
Production scientifique et compétences > Euler Center for Signal Processing
Travail produit à l'EPFL
Rapports techniques
Publié

Date de création de la notice 2006-03-10

Files

Résumé

Détails

PDF