report
Direct optimisation of a multilayer perceptron for the estimation of cepstral mean and variance statistics
2007
We propose an alternative means of training a multilayer perceptron for the task of speech activity detection based on a criterion to minimise the error in the estimation of mean and variance statistics for speech cepstrum based features using the Kullback-Leibler divergence. We present our baseline and proposed speech activity detection approaches for multi-channel meeting room recordings and demonstrate the effectiveness of the new criterion by comparing the two approaches when used to carry out cepstrum mean and variance normalisation of features used in our meeting ASR system.
Type
report
Author(s)
Vepa, Jithendra
Date Issued
2007
Publisher
IDIAP
Written at
EPFL
EPFL units
Available on Infoscience
February 11, 2010
Use this identifier to reference this record