An Acoustic Model Based on Kullback-Leibler Divergence for Posterior Features

Aradilla, Guillermo; Vepa, Jithendra; Bourlard, Hervé

doi:10.1109/ICASSP.2007.366998

conference paper

An Acoustic Model Based on Kullback-Leibler Divergence for Posterior Features

Aradilla, Guillermo

•

Vepa, Jithendra

•

Bourlard, Hervé

2007

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07

This paper investigates the use of features based on posterior probabilities of subword units such as phonemes. These features are typically transformed when used as inputs for a hidden Markov model with mixture of Gaussians as emission distribution (HMM/GMM). In this work, we introduce a novel acoustic model that avoids the Gaussian assumption and directly uses posterior features without any transformation. This model is described by a finite state machine where each state is characterized by a target distribution and the cost function associated to each state is given by the Kullback-Leibler (KL) divergence between its target distribution and the posterior features. Furthermore, hybrid HMM/ANN system can be seen as a particular case of this KL-based model where state target distributions are predefined. A training method is also presented that minimizes the KL-divergence between the state target distributions and the posteriors features.

Name

aradilla-icassp-2007.pdf

Access type

openaccess

Size

118.35 KB

Format

Adobe PDF

Checksum (MD5)

11dff62e2cc79cc3acf56be384a6bdae