The subspace Gaussian mixture model—A structured model for speech recognition

Povey, Daniel; Burget, Lukáš; Agarwal, Mohit; Akyazi, Pinar; Kai, Feng; Ghoshal, Arnab; Glembek, Ondřej

doi:10.1016/j.csl.2010.06.003

research article

The subspace Gaussian mixture model—A structured model for speech recognition

Povey, Daniel

•

Burget, Lukáš

•

Agarwal, Mohit

more

2011

Computer Speech & Language

We describe a new approach to speech recognition, in which all Hidden Markov Model (HMM) states share the same Gaussian Mixture Model (GMM) structure with the same number of Gaussians in each state. The model is defined by vectors associated with each state with a dimension of, say, 50, together with a global mapping from this vector space to the space of parameters of the GMM. This model appears to give better results than a conventional model, and the extra structure offers many new opportunities for modeling innovations while maintaining compatibility with most standard techniques.

Name

1-s2.0-S088523081000063X-main.pdf

Access type

openaccess

Size

483.1 KB

Format

Adobe PDF

Checksum (MD5)

544c2a645f3ec1ed9e26a39a4d5f4ccd