Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition

Imseng, David; Rasipuram, Ramya; Magimai.-Doss, Mathew

Imseng, David; Rasipuram, Ramya; Magimai.-Doss, Mathew

2012

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

One of the main challenge in non-native speech recognition is how to handle acoustic variability present in multiaccented non-native speech with limited amount of training data. In this paper, we investigate an approach that addresses this challenge by using Kullback-Leibler divergence based hidden Markov models (KL-HMM). More precisely, the acoustic variability in the multi-accented speech is handled by using multilingual phoneme posterior probabilities, estimated by a multilayer perceptron trained on auxiliary data, as input feature for the KL-HMM system. With limited training data, we then build better acoustic models by exploiting the advantage that the KL-HMM system has fewer number of parameters. On HIWIRE corpus, the proposed approach yields a performance of 1.9% word error rate (WER) with 149 minutes of training data and a performance of 5.5% WER with 2 minutes of training data.

Details

Title Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition

Author(s) Imseng, David ; Rasipuram, Ramya ; Magimai.-Doss, Mathew

Date 2012

Publisher Idiap

Keywords

Hidden Markov Model; Kullback-Leibler divergence; multilayer perceptron; Posterior features

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Work produced at EPFL
Technical Reports

Record creation date 2013-12-19

Actions

Preview

Select file: