Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition

Garner, Philip N.

doi:10.1016/j.specom.2011.05.007

Garner, Philip N.

2011

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Cepstral normalisation in automatic speech recognition is investigated in the context of robustness to additive noise. In this paper, it is argued that such normalisation leads naturally to a speech feature based on signal to noise ratio rather than absolute energy (or power). Explicit calculation of this SNR-cepstrum by means of a noise estimate is shown to have theoretical and practical advantages over the usual (energy based) cepstrum. The relationship between the SNR-cepstrum and the articulation index, known in psycho-acoustics, is discussed. Experiments are presented suggesting that the combination of the SNR-cepstrum with the well known perceptual linear prediction method can be beneficial in noisy environments.

Details

Title Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition

Author(s) Garner, Philip N.

Published in Speech Communication

Volume 53

Issue 8

Pages 991-1001

Date 2011

DOI https://doi.org/10.1016/j.specom.2011.05.007

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2011-07-06

Actions

Preview

Select file: