Comparing Different Word Lattice Rescoring Approaches Towards Keyword Spotting

Pinto, Joel Praveen; Bourlard, Hervé; Greve, Zacharie De; Hermansky, Hynek

Pinto, Joel Praveen; Bourlard, Hervé; Greve, Zacharie De; Hermansky, Hynek

2007

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

In this paper, we further investigate the large vocabulary continuous speech recognition approach to keyword spotting. Given a speech utterance, recognition is performed to obtain a word lattice. The posterior probability of keyword hypotheses in the lattice is computed and used to derive a confidence measure to accept/reject the keyword. We extend this framework and replace the acoustic likelihoods in the lattice obtained from a Gaussian mixture model (GMM) with likelihoods derived from a multilayered perceptron (MLP). We compare the two rescoring techniques on the conversational telephone speech database distributed by NIST for the spoken term detection evaluation. Experimental results show that GMM lattices still perform better than the rescored lattices for short and medium length keywords, but on longer keywords, the MLP rescored word lattices perform slightly better.

Details

Title Comparing Different Word Lattice Rescoring Approaches Towards Keyword Spotting

Author(s) Pinto, Joel Praveen ; Bourlard, Hervé ; Greve, Zacharie De ; Hermansky, Hynek

Date 2007

Publisher IDIAP

Note Submitted for publication

Additional link URL

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Work produced at EPFL
Technical Reports
Published

Record creation date 2010-02-11

Actions

Preview

Select file: