Recognition Of Reverberant Speech Using Frequency Domain Linear Prediction

Thomas, Samuel; Ganapathy, Sriram; Hermansky, Hynek

Thomas, Samuel; Ganapathy, Sriram; Hermansky, Hynek

2008

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Performance of a typical automatic speech recognition (ASR) system severely degrades when it encounters speech from reverberant environments. Part of the reason for this degradation is the feature extraction techniques that use analysis windows which are much shorter than typical room impulse responses. We present a feature extraction technique based on modeling temporal envelopes of the speech signal in narrow sub-bands using Frequency Domain Linear Prediction (FDLP). FDLP provides an all-pole approximation of the Hilbert envelope of the signal obtained by linear prediction on cosine transform of the signal. ASR experiments on speech data degraded with a number of room impulse responses (with varying degrees of distortion) show significant performance improvements for the proposed FDLP features when compared to other robust feature extraction techniques (average relative reduction of $24 \%$ in word error rate). Similar improvements are also obtained for far-field data which contain natural reverberation in background noise. These results are achieved without any noticeable degradation in performance for clean speech.

Details

Title Recognition Of Reverberant Speech Using Frequency Domain Linear Prediction

Author(s) Thomas, Samuel ; Ganapathy, Sriram ; Hermansky, Hynek

Date 2008

Publisher IDIAP

Note To appear in IEEE Signal Processing Letters 2008

Additional link URL

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Work produced at EPFL
Technical Reports
Published

Record creation date 2010-02-11

Actions

Preview

Select file: