Exploiting Contextual Information for Improved Phoneme Recognition

Pinto, Joel Praveen; Yegnanarayana, B.; Hermansky, Hynek; Magimai.-Doss, Mathew

Pinto, Joel Praveen; Yegnanarayana, B.; Hermansky, Hynek; Magimai.-Doss, Mathew

2007

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

In this paper, we investigate the significance of contextual information in a phoneme recognition system using the hidden Markov model - artificial neural network paradigm. Contextual information is probed at the feature level as well as at the output of the multilayerd perceptron. At the feature level, we analyse and compare different methods to model sub-phonemic classes. To exploit the contextual information at the output of the multilayered perceptron, we propose the hierarchical estimation of phoneme posterior probabilities. The best phoneme (excluding silence) recognition accuracy of 73.4\% on the TIMIT database is comparable to that of the state-of-the-art systems, but more emphasis is on analysis of the contextual information.

Details

Title Exploiting Contextual Information for Improved Phoneme Recognition

Author(s) Pinto, Joel Praveen ; Yegnanarayana, B. ; Hermansky, Hynek ; Magimai.-Doss, Mathew

Date 2007

Publisher IDIAP

Additional link URL

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Work produced at EPFL
Technical Reports
Published

Record creation date 2010-02-11

Files

Abstract

Details

PDF