Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition

Garner, Philip N.

doi:10.1016/j.specom.2011.05.007

research article

Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition

Garner, Philip N.

2011

Speech Communication

Cepstral normalisation in automatic speech recognition is investigated in the context of robustness to additive noise. In this paper, it is argued that such normalisation leads naturally to a speech feature based on signal to noise ratio rather than absolute energy (or power). Explicit calculation of this SNR-cepstrum by means of a noise estimate is shown to have theoretical and practical advantages over the usual (energy based) cepstrum. The relationship between the SNR-cepstrum and the articulation index, known in psycho-acoustics, is discussed. Experiments are presented suggesting that the combination of the SNR-cepstrum with the well known perceptual linear prediction method can be beneficial in noisy environments.

Name

Garner_SPECOM_2011.pdf

Access type

openaccess

Size

279.75 KB

Format

Adobe PDF

Checksum (MD5)

7d7fc38ede6337651c1f341304924310