Acoustic Data-driven Grapheme-to-Phoneme Conversion using KL-HMM

Rasipuram, Ramya; Magimai.-Doss, Mathew

doi:10.1109/ICASSP.2012.6289003

2012

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

This paper proposes a novel grapheme-to-phoneme (G2P) conversion approach where first the probabilistic relation between graphemes and phonemes is captured from acoustic data using Kullback-Leibler divergence based hidden Markov model (KL-HMM) system. Then, through a simple decoding framework the information in this probabilistic relation is integrated with the sequence information in the orthographic transcription of the word to infer the phoneme sequence. One of the main application of the proposed G2P approach is in the area of low linguistic resource based automatic speech recognition or text-to-speech systems. We demonstrate this potential through a simulation study where linguistic resources from one domain is used to create linguistic resources for a different domain.

Details

Title Acoustic Data-driven Grapheme-to-Phoneme Conversion using KL-HMM

Author(s) Rasipuram, Ramya ; Magimai.-Doss, Mathew

Published in 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pages 4841-4844

Conference IEEE International Conference on Acoustics, Speech and Signal Processing

Date 2012

DOI https://doi.org/10.1109/ICASSP.2012.6289003

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Conference Papers
Work produced at EPFL

Record creation date 2013-12-19

Abstract

Details

Actions