Improving Posterior Based Confidence Measures in Hybrid HMM/ANN Speech Recognition Systems

Bernardis, Giulia; Bourlard, Hervé

doi:10.21437/ICSLP.1998-420

conference paper

Improving Posterior Based Confidence Measures in Hybrid HMM/ANN Speech Recognition Systems

Bernardis, Giulia

•

Bourlard, Hervé

1998

Proceedings of International Conference on Spoken Language Processing (ICSLP'98)

In this paper we define and investigate a set of confidence measures based on hybrid Hidden Markov Model/Artificial Neural Network (HMM/ANN) acoustic models. All these measures are using the neural network to estimate the local phone posterior probabilities, which are then combined and normalized in different ways. Experimental results will indeed show that the use of an appropriate duration normalization is very important to obtain good estimates of the phone and word confidences. The different measures are evaluated at the phone and word levels on both an isolated word task (PHONEBOOK) and a continuous speech recognition task (BREF). It will be shown that one of those confidence measures is well suited for utterance verification, and that (as one could expect) confidence measures at the word level perform better than those at the phone level. Finally, using the resulting approach on PHONEBOOK to rescore the N-best list is shown to yield a 34% decrease in word error rate.

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/227795

Name

98-11.pdf

Access type

openaccess

Size

77.09 KB

Format

Adobe PDF

Checksum (MD5)

8397c3de370c95851bcedd33469f70a0