Towards mixed language speech recognition systems

Imseng, David; Bourlard, Hervé; Magimai.-Doss, Mathew

doi:10.21437/Interspeech.2010-110

Imseng, David; Bourlard, Hervé; Magimai.-Doss, Mathew

2010

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Multilingual speech recognition obviously involves numerous research challenges, including common phoneme sets, adaptation on limited amount of training data, as well as mixed language recognition (common in many countries, like Switzerland). In this latter case, it is not even possible to assume that one knows in advance the language being spoken. This is the context and motivation of the present work. We indeed investigate how current state-of-the-art speech recognition systems can be exploited in multilingual environments, where the language (from an assumed set of five possible languages, in our case) is not a priori known during recognition. We combine monolingual systems and extensively develop and compare different features and acoustic models. On SpeechDat(II) datasets, and in the context of isolated words, we show that it is actually possible to approach the performances of monolingual systems even if the identity of the spoken language is not a priori known.

Details

Title Towards mixed language speech recognition systems

Author(s) Imseng, David ; Bourlard, Hervé ; Magimai.-Doss, Mathew

Published in Interspeech 2010

Pages 278-281

Conference Interspeech, Makuhari, Japan

Date 2010

DOI https://doi.org/10.21437/Interspeech.2010-110

Additional link URL; Related documents

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Conference Papers
Work produced at EPFL
Published

Record creation date 2010-08-26

Actions

Preview

Select file: