Using out-of-language data to improve an under-resourced speech recognizer

Imseng, David; Motlicek, Petr; Bourlard, Hervé; Garner, Philip N.

doi:10.1016/j.specom.2013.01.007

Imseng, David; Motlicek, Petr; Bourlard, Hervé; Garner, Philip N.

2014

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Under-resourced speech recognizers may benefit from data in languages other than the target language. In this paper, we report how to boost the performance of an Afrikaans automatic speech recognition system by using already available Dutch data. We successfully exploit available multilingual resources through 1) posterior features, estimated by multilayer perceptrons (MLP) and 2) subspace Gaussian mixture models (SGMMs). Both the MLPs and the SGMMs can be trained on out-of-language data. We use three different acoustic modeling techniques, namely Tandem, Kullback-Leibler divergence based HMMs (KL-HMM) as well as SGMMs and show that the proposed multilingual systems yield 12% relative improvement compared to a conventional monolingual HMM/GMM system only trained on Afrikaans. We also show that KL-HMMs are extremely powerful for under-resourced languages: using only six minutes of Afrikaans data (in combination with out-of-language data), KL-HMM yields about 30% relative improvement compared to conventional maximum likelihood linear regression and maximum a posteriori based acoustic model adaptation.

Details

Title Using out-of-language data to improve an under-resourced speech recognizer

Author(s) Imseng, David ; Motlicek, Petr ; Bourlard, Hervé ; Garner, Philip N.

Published in Speech Communication

Volume 56

Pages 142-151

Date 2014

ISSN 0167-6393

Keywords

Multilingual speech recognition; Posterior features; Subspace Gaussian mixture models; Under-resourced languages; Afrikaans

DOI https://doi.org/10.1016/j.specom.2013.01.007

Other identifier(s) View record in Web of Science

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Work produced at EPFL
Journal Articles
Published

Record creation date 2013-12-19

Actions

Preview

Select file: