Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans

Imseng, David; Bourlard, Hervé; Garner, Philip N.

Imseng, David; Bourlard, Hervé; Garner, Philip N.

2012

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Under-resourced speech recognizers may benefit from data in languages other than the target language. In this paper, we boost the performance of an Afrikaans speech recognizer by using already available data from other languages. To successfully exploit available multilingual resources, we use posterior features, estimated by multilayer perceptrons that are trained on similar languages. For two different acoustic modeling techniques, Tandem and Kullback-Leibler divergence based HMMs, the proposed multilingual system yields more than 10% relative improvement compared to the corresponding monolingual systems only trained on Afrikaans.

Details

Title Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans

Author(s) Imseng, David ; Bourlard, Hervé ; Garner, Philip N.

Date 2012

Publisher Idiap

Additional link Related documents

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Work produced at EPFL
Technical Reports

Record creation date 2013-12-19

Files

Abstract

Details

PDF