Using KL-divergence and multilingual information to improve ASR for under-resourced languages

Imseng, David; Bourlard, Hervé; Garner, Philip N.

doi:10.1109/ICASSP.2012.6289010

Imseng, David; Bourlard, Hervé; Garner, Philip N.

2012

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Setting out from the point of view that automatic speech recognition (ASR) ought to benefit from data in languages other than the target language, we propose a novel Kullback-Leibler (KL) divergence based method that is able to exploit multilingual information in the form of universal phoneme posterior probabilities conditioned on the acoustics. We formulate a means to train a recognizer on several different languages, and subsequently recognize speech in a target language for which only a small amount of data is available. Taking the Greek SpeechDat(II) data as an example, we show that the proposed formulation is sound, and show that it is able to outperform a current state-of-the-art HMM/GMM system. We also use a hybrid Tandem-like system to further understand the source of the benefit.

Details

Title Using KL-divergence and multilingual information to improve ASR for under-resourced languages

Author(s) Imseng, David ; Bourlard, Hervé ; Garner, Philip N.

Published in 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pages 4869-4872

Conference IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto

Date 2012

DOI https://doi.org/10.1109/ICASSP.2012.6289010

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Conference Papers
Work produced at EPFL

Record creation date 2013-12-19

Files

Abstract

Details

PDF