Improving non-native ASR through stochastic multilingual phoneme space transformations

Imseng, David; Bourlard, Hervé; Dines, John; Garner, Philip N.; Magimai.-Doss, Mathew

doi:10.21437/Interspeech.2011-225

Imseng, David; Bourlard, Hervé; Dines, John; Garner, Philip N.; Magimai.-Doss, Mathew

2011

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

We propose a stochastic phoneme space transformation technique that allows the conversion of conditional source phoneme posterior probabilities (conditioned on the acoustics) into target phoneme posterior probabilities. The source and target phonemes can be in any language and phoneme format such as the International Phonetic Alphabet. The novel technique makes use of a Kullback-Leibler divergence based hidden Markov model and can be applied to non-native and accented speech recognition or used to adapt systems to underresourced languages. In this paper, and in the context of hybrid HMM/MLP recognizers, we successfully apply the proposed approach to non-native English speech recognition on the HIWIRE dataset.

Details

Title Improving non-native ASR through stochastic multilingual phoneme space transformations

Author(s) Imseng, David ; Bourlard, Hervé ; Dines, John ; Garner, Philip N. ; Magimai.-Doss, Mathew

Published in Interspeech 2011

Pages 537-540

Conference Interspeech, Florence, Italy

Date 2011

DOI https://doi.org/10.21437/Interspeech.2011-225

Additional link Related documents

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Conference Papers
Work produced at EPFL
Published

Record creation date 2011-07-06

Files

Abstract

Details

PDF