The subspace Gaussian mixture model—A structured model for speech recognition

Glembek, Ondřej

doi:10.1016/j.csl.2010.06.003

research article

The subspace Gaussian mixture model—A structured model for speech recognition

Povey, Daniel

•

Burget, Lukáš

•

Agarwal, Mohit

2011

Computer Speech & Language

We describe a new approach to speech recognition, in which all Hidden Markov Model (HMM) states share the same Gaussian Mixture Model (GMM) structure with the same number of Gaussians in each state. The model is defined by vectors associated with each state with a dimension of, say, 50, together with a global mapping from this vector space to the space of parameters of the GMM. This model appears to give better results than a conventional model, and the extra structure offers many new opportunities for modeling innovations while maintaining compatibility with most standard techniques.

Type

research article

DOI

10.1016/j.csl.2010.06.003

Author(s)

Povey, Daniel

Burget, Lukáš

Agarwal, Mohit

Akyazi, Pinar

Kai, Feng

Ghoshal, Arnab

Glembek, Ondřej

Date Issued

2011

Publisher

Elsevier

Published in

Computer Speech & Language

Volume

25

Issue

2

Start page

404

End page

439

Subjects

Speech recognition

•

Gaussian Mixture Model

•

Subspace Gaussian Mixture Model

Editorial or Peer reviewed

REVIEWED

Written at

OTHER

EPFL units

IEL

Available on Infoscience

November 19, 2014

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/108945