Combining Evidence from a Generative and a Discriminative Model in Phoneme Recognition

Pinto, Joel Praveen; Hermansky, Hynek

doi:10.21437/Interspeech.2008-132

conference paper

Combining Evidence from a Generative and a Discriminative Model in Phoneme Recognition

Pinto, Joel Praveen

•

Hermansky, Hynek

2008

Proceedings of Interspeech

We investigate the use of the log-likelihood of the features obtained from a generative Gaussian mixture model, and the posterior probability of phonemes from a discriminative multilayered perceptron in multi-stream combination for recognition of phonemes. Multi-stream combination techniques, namely early integration and late integration are used to combine the evidence from these models. By using multi-stream combination, we obtain a phoneme recognition accuracy of 74% on the standard TIMIT database, an absolute improvement of 2.5% over the single best stream.

Name

pinto-IS-2008.pdf

Access type

openaccess

Size

61.34 KB

Format

Adobe PDF

Checksum (MD5)

79c07249782480f48cd523b3de8a97c8