Language dependent universal phoneme posterior estimation for mixed language speech recognition

Imseng, David; Bourlard, Hervé; Magimai.-Doss, Mathew; Dines, John

Imseng, David; Bourlard, Hervé; Magimai.-Doss, Mathew; Dines, John

2011

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

This paper presents a new approach to estimate "universal" phoneme posterior probabilities for mixed language speech recognition. More specifically, we propose a new theoretical framework to combine phoneme class posterior probabilities in a principled way by using (statistical) evidence about the language identity. We investigate the proposed approach in a mixed language environment (SpeechDat(II)) consisting of five European languages. Our studies show that the proposed approach can yield significant improvements on a mixed language task, while maintaining the performance on monolingual tasks. Additionally, through a case study, we also demonstrate the potential benefits of the proposed approach for non-native speech recognition.

Details

Title Language dependent universal phoneme posterior estimation for mixed language speech recognition

Author(s) Imseng, David ; Bourlard, Hervé ; Magimai.-Doss, Mathew ; Dines, John

Date 2011

Publisher Idiap

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Work produced at EPFL
Technical Reports
Published

Record creation date 2011-07-06