Articulatory feature based continuous speech recognition using probabilistic lexical modeling

Rasipuram, Ramya; Magimai.-Doss, Mathew

doi:10.1016/j.csl.2015.04.003

Rasipuram, Ramya; Magimai.-Doss, Mathew

2016

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Phonological studies suggest that the typical subword units such as phones or phonemes used in automatic speech recognition systems can be decomposed into a set of features based on the articulators used to produce the sound. Most of the current approaches to integrate articulatory feature (AF) representations into an automatic speech recognition (ASR) system are based on a deterministic knowledge-based phoneme-to-AF relationship. In this paper, we propose a novel two stage approach in the framework of probabilistic lexical modeling to integrate AF representations into an ASR system. In the first stage, the relationship between acoustic feature observations and various AFs is modeled. In the second stage, a probabilistic relationship between subword units and AFs is learned using transcribed speech data. Our studies on a continuous speech recognition task show that the proposed approach effectively integrates AFs into an ASR system. Furthermore, the studies show that either phonemes or graphemes can be used as subword units. Analysis of the probabilistic relationship captured by the parameters has shown that the approach is capable of adapting the knowledge-based phoneme-to-AF representations using speech data; and allows different AFs to evolve asynchronously.

Details

Title Articulatory feature based continuous speech recognition using probabilistic lexical modeling

Author(s) Rasipuram, Ramya ; Magimai.-Doss, Mathew

Published in Computer Speech and Language

Pagination 27

Volume 36

Pages 233-259

Date 2016

Publisher London, Elsevier

Keywords

articulatory features; Automatic Speech Recognition; Grapheme subword units; Kullback–Leibler divergence based hidden Markov model; phoneme subword units; probabilistic lexical modeling

DOI https://doi.org/10.1016/j.csl.2015.04.003

Other identifier(s) View record in Web of Science

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2015-07-19

Actions

Preview

Select file: