Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition.

Garner, Philip N.

Garner, Philip N.

2011

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Cepstral normalisation in automatic speech recognition is investigated in the context of robustness to additive noise. It is argued that such normalisation leads naturally to a speech feature based on signal to noise ratio rather than absolute energy (or power). Explicit calculation of this {\em SNR-cepstrum} by means of a noise estimate is shown to have theoretical and practical advantages over the usual (energy based) cepstrum. The SNR-cepstrum is shown to be almost identical to the articulation index known in psycho-acoustics. Combination of the SNR-cepstrum with the well known perceptual linear prediction method is shown to be beneficial in noisy environments.

Details

Title Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition.

Author(s) Garner, Philip N.

Date 2011

Publisher Idiap

Keywords

aurora; Automatic Speech Recognition; cepstral normalisation; Noise Robustness

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Work produced at EPFL
Technical Reports
Published

Record creation date 2011-07-06

Files

Abstract

Details

PDF