Using Comparison of Parallel Phoneme Probability streams for OOV Word Detection

Tosic, Tamara; Magimai-Doss, Mathew; Hermansky, Hynek

Tosic, Tamara; Magimai-Doss, Mathew; Hermansky, Hynek

2008

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

In this paper, we investigate the approach of comparing two different parallel streams of phoneme posterior probability estimates for OOV word detection. The ﬁrst phoneme posterior probability stream is estimated using only the knowledge of short-term acoustic observation. In our work we refer this stream as ”out-of-context posteriors”. The second posterior probability stream, referred also as ”in-context posteriors” is estimated using the knowledge of the whole acoustic observation sequence: the acoustic model and the language model of an ASR system. In particular, we focus our study on different types of distance measures, namely KL-divergence and Euclidean distance, to compare the two phoneme posterior probability streams. Our experiments on large vocabulary automatic speech recognition task shows that using KL-divergence measure estimated with the in-context posteriors as reference distribution consistently yields a better OOV word detection system.

Details

Title Using Comparison of Parallel Phoneme Probability streams for OOV Word Detection

Author(s) Tosic, Tamara ; Magimai-Doss, Mathew ; Hermansky, Hynek

Conference EUSIPCO, Lausanne, Switzerland, August 2008.

Date 2008

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Record creation date 2011-11-09

Abstract

Details

Actions