An Acoustic Model Based on Kullback-Leibler Divergence for Posterior Features

Aradilla, Guillermo; Vepa, Jithendra; Bourlard, Hervé

report

Aradilla, Guillermo

•

Vepa, Jithendra

•

Bourlard, Hervé

2006

This paper investigates the use of features based on posterior probabilities of subword units such as phonemes. These features are typically transformed when used as inputs for a hidden Markov model with mixture of Gaussians as emission distribution (HMM/GMM). In this work, we introduce a novel acoustic model that avoids the Gaussian assumption and directly uses posterior features without any transformation. This model is described by a finite state machine where each state is characterized by a target distribution and the cost function associated to each state is given by the Kullback-Leibler (KL) divergence between its target distribution and the posterior features. Furthermore, hybrid HMM/ANN system can be seen as a particular case of this KL-based model where state target distributions are predefined. A training method is also presented that minimizes the KL-divergence between the state target distributions and the posteriors features.

Name

aradilla-idiap-rr-06-60.pdf

Access type

openaccess

Size

118.35 KB

Format

Adobe PDF

Checksum (MD5)

a1392e85b6cf9552eeda201e34c8b512