Exploiting Contextual Information for Improved Phoneme Recognition

Pinto, Joel Praveen; Yegnanarayana, B.; Hermansky, Hynek; Magimai.-Doss, Mathew

Pinto, Joel Praveen; Yegnanarayana, B.; Hermansky, Hynek; Magimai.-Doss, Mathew

2007

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Résumé

In this paper, we investigate the significance of contextual information in a phoneme recognition system using the hidden Markov model - artificial neural network paradigm. Contextual information is probed at the feature level as well as at the output of the multilayerd perceptron. At the feature level, we analyse and compare different methods to model sub-phonemic classes. To exploit the contextual information at the output of the multilayered perceptron, we propose the hierarchical estimation of phoneme posterior probabilities. The best phoneme (excluding silence) recognition accuracy of 73.4\% on the TIMIT database is comparable to that of the state-of-the-art systems, but more emphasis is on analysis of the contextual information.

Détails

Titre Exploiting Contextual Information for Improved Phoneme Recognition

Auteur(s) Pinto, Joel Praveen ; Yegnanarayana, B. ; Hermansky, Hynek ; Magimai.-Doss, Mathew

Date 2007

Editeur IDIAP

Lien supplémentaire URL

Laboratoires LIDIAP

Le document apparaît dans Production scientifique et compétences > STI - Faculté des sciences et techniques de l'ingénieur > IEM - Institute of Electrical and Micro Engineering > LIDIAP - Laboratoire de l'IDIAP
Production scientifique et compétences > Euler Center for Signal Processing
Travail produit à l'EPFL
Rapports techniques
Publié

Date de création de la notice 2010-02-11

Files

Résumé

Détails

PDF