Current trends in multilingual speech processing

Bourlard, Hervé; Dines, John; Magimai.-Doss, Mathew; Garner, Philip N.; Imseng, David; Motlicek, Petr; Liang, Hui; Saheer, Lakshmi; Valente, Fabio

doi:10.1007/s12046-011-0050-4

Bourlard, Hervé; Dines, John; Magimai.-Doss, Mathew; Garner, Philip N.; Imseng, David; Motlicek, Petr; Liang, Hui; Saheer, Lakshmi; Valente, Fabio

2011

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

In this paper, we describe recent work at Idiap Research Institute in the domain of multilingual speech processing and provide some insights into emerging challenges for the research community. Multilingual speech processing has been a topic of ongoing interest to the research community for many years and the field is now receiving renewed interest owing to two strong driving forces. Firstly, technical advances in speech recognition and synthesis are posing new challenges and opportunities to researchers. For example, discriminative features are seeing wide application by the speech recognition community, but additional issues arise when using such features in a multilingual setting. Another example is the apparent convergence of speech recognition and speech synthesis technologies in the form of statistical parametric methodologies. This convergence enables the investigation of new approaches to unified modelling for automatic speech recognition and text-to-speech synthesis (TTS) as well as cross-lingual speaker adaptation for TTS. The second driving force is the impetus being provided by both government and industry for technologies to help break down domestic and international language barriers, these also being barriers to the expansion of policy and commerce. Speech-to-speech and speech-to-text translation are thus emerging as key technologies at the heart of which lies multilingual speech processing.

Details

Title Current trends in multilingual speech processing

Author(s) Bourlard, Hervé ; Dines, John ; Magimai.-Doss, Mathew ; Garner, Philip N. ; Imseng, David ; Motlicek, Petr ; Liang, Hui ; Saheer, Lakshmi ; Valente, Fabio

Published in Sadhana-Academy Proceedings In Engineering Sciences

Volume 36

Issue 5

Pages 885-915

Date 2011

Keywords

Multilingual speech processing; speech synthesis; speech recognition; speech-to-speech translation; language identification; Automatic Language Identification; Recognition; Hmm

DOI https://doi.org/10.1007/s12046-011-0050-4

Other identifier(s) View record in Web of Science

Additional link URL

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2012-06-25

Actions

Preview

Select file: