The Kaldi Speech Recognition Toolkit

Povey, Daniel; Ghoshal, Arnab; Boulianne, Gilles; Burget, Lukas; Glembek, Ondrej; Goel, Nagendra; Hannemann, Mirko; Motlicek, Petr; Qian, Yanmin; Schwarz, Petr; Silovsky, Jan; Stemmer, Georg; Vesely, Karel

conference paper not in proceedings

Povey, Daniel

•

Ghoshal, Arnab

•

Boulianne, Gilles

2011

IEEE 2011 Workshop on Automatic Speech Recognition and Understanding

We describe the design of Kaldi, a free, open-source toolkit for speech recognition research. Kaldi provides a speech recognition system based on finite-state transducers (using the freely available OpenFst), together with detailed documentation and scripts for building complete recognition systems. Kaldi is written is C++, and the core library supports modeling of arbitrary phonetic-context sizes, acoustic modeling with subspace Gaussian mixture models (SGMM) as well as standard Gaussian mixture models, together with all commonly used linear and affine transforms. Kaldi is released under the Apache License v2.0, which is highly nonrestrictive, making it suitable for a wide community of users.

Name

Povey_ASRU2011_2011.pdf

Access type

openaccess

Size

132.94 KB

Format

Adobe PDF

Checksum (MD5)

54649f0f00582bed1503f21c3818e0a6