A Spectrogram Model for Enhanced Source Localization and Noise-Robust ASR

Lathoud, Guillaume; Magimai.-Doss, Mathew; Mesot, Bertrand

doi:10.21437/Interspeech.2005-747

Lathoud, Guillaume; Magimai.-Doss, Mathew; Mesot, Bertrand

2005

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

This paper proposes a simple, computationally efficient 2-mixture model approach to discrimination between speech and background noise. It is directly derived from observations on real data, and can be used in a fully unsupervised manner, with the EM algorithm. A first application to sector-based, joint audio source localization and detection, using multiple microphones, confirms that the model can provide major enhancement. A second application to the single channel speech recognition task in a noisy environment yields major improvement on stationary noise and promising results on non-stationary noise.

Details

Title A Spectrogram Model for Enhanced Source Localization and Noise-Robust ASR

Author(s) Lathoud, Guillaume ; Magimai.-Doss, Mathew ; Mesot, Bertrand

Published in Proceedings of Interspeech 2005

Pages 2345-2348

Conference INTERSPEECH 2005

Date 2005

Publisher Lisbon, Portugal

Keywords

speech; lathoud; mathew; bmesot

Note IDIAP-RR 05-13

DOI https://doi.org/10.21437/Interspeech.2005-747

Additional link URL; Related documents

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Conference Papers
Work produced at EPFL
Published

Record creation date 2006-03-10

Actions

Preview

Select file: