Unsupervised Spectral Subtraction for Noise-Robust ASR on Unknown Transmission Channels

Lathoud, Guillaume; Magimai.-Doss, Mathew; Bourlard, Hervé

Lathoud, Guillaume; Magimai.-Doss, Mathew; Bourlard, Hervé

2006

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

This paper addresses several issues of classical spectral subtraction methods with respect to the automatic speech recognition task in noisy environments. The main contributions of this paper are twofold. First, a channel normalization method is proposed to extend spectral subtraction to the case of transmission channels such as cellphones. It equalizes the transmission channel and removes part of the additive noise. Second, a simple, computationally efficient \mbox{2-component} probabilistic model is proposed to discriminate between speech and additive noise at the magnitude spectrogram level. Based on this model, an alternative to classical spectral subtraction is proposed, called ``Unsupervised Spectral Subtraction''~(USS). The main difference is that the proposed approach does not require any parameter tuning. Experimental studies on Aurora~2 show that channel normalization followed by USS compares advantageously to both classical spectral subtraction, and the ETSI standard front-end (Wiener filtering). Compared to the ETSI standard front-end, a 21.3\%~relative improvement is obtained on 0 to 20~dB noise conditions, for an absolute loss of 0.1~\% in clean conditions. The computational cost of the proposed approach is very low, which makes it fit for real-time applications.

Details

Title Unsupervised Spectral Subtraction for Noise-Robust ASR on Unknown Transmission Channels

Author(s) Lathoud, Guillaume ; Magimai.-Doss, Mathew ; Bourlard, Hervé

Date 2006

Publisher Martigny, Switzerland, IDIAP

Keywords

speech; lathoud; mathew; bourlard

Additional link URL

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Work produced at EPFL
Technical Reports
Published

Record creation date 2006-03-10

Actions

Preview

Select file: