A Spectrogram Model for Enhanced Source Localization and Noise-Robust ASR

Lathoud, Guillaume; Magimai.-Doss, Mathew; Mesot, Bertrand

doi:10.21437/Interspeech.2005-747

conference paper

A Spectrogram Model for Enhanced Source Localization and Noise-Robust ASR

Lathoud, Guillaume

•

Magimai.-Doss, Mathew

•

Mesot, Bertrand

2005

Proceedings of Interspeech 2005

INTERSPEECH 2005

This paper proposes a simple, computationally efficient 2-mixture model approach to discrimination between speech and background noise. It is directly derived from observations on real data, and can be used in a fully unsupervised manner, with the EM algorithm. A first application to sector-based, joint audio source localization and detection, using multiple microphones, confirms that the model can provide major enhancement. A second application to the single channel speech recognition task in a noisy environment yields major improvement on stationary noise and promising results on non-stationary noise.

Name

lathoud05c.pdf

Access type

openaccess

Size

3.91 MB

Format

Adobe PDF

Checksum (MD5)

888dc9f1379bebdfc0f97683d16b4d19