The Kaldi Speech Recognition Toolkit

Vesely, Karel

report

Povey, Daniel

•

Ghoshal, Arnab

•

Boulianne, Gilles

2012

We describe the design of Kaldi, a free, open-source toolkit for speech recognition research. Kaldi provides a speech recognition system based on finite-state automata (using the freely available OpenFst), together with detailed documentation and a comprehensive set of scripts for building complete recognition systems. Kaldi is written is C++, and the core library supports modeling of arbitrary phonetic-context sizes, acoustic modeling with subspace Gaussian mixture models (SGMM) as well as standard Gaussian mixture models, together with all commonly used linear and affine transforms. Kaldi is released under the Apache License v2.0, which is highly nonrestrictive, making it suitable for a wide community of users.

Type

report

Author(s)

Povey, Daniel

•

Ghoshal, Arnab

•

Boulianne, Gilles

•

Burget, Lukas

•

Glembek, Ondrej

•

Goel, Nagendra

•

Hannemann, Mirko

•

Motlicek, Petr

•

Qian, Yanmin

•

Schwarz, Petr

Date Issued

2012

Publisher

Idiap

Subjects

ASR

•

Automatic Speech Recognition

•

GMM

•

HTK

•

SGMM

Written at

EPFL

EPFL units

LIDIAP

Available on Infoscience

December 19, 2013

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/98595