Hierarchical Tandem Features for ASR in Mandarin

Pinto, Joel Praveen; Magimai.-Doss, Mathew; Bourlard, Hervé

Pinto, Joel Praveen; Magimai.-Doss, Mathew; Bourlard, Hervé

2010

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

We apply multilayer perceptron (MLP) based hierarchical Tandem features to large vocabulary continuous speech recognition in Mandarin. Hierarchical Tandem features are estimated using a cascade of two MLP classifiers which are trained independently. The first classifier is trained on perceptual linear predictive coefficients with a 90 ms temporal context. The second classifier is trained using the phonetic class conditional probabilities estimated by the first MLP, but with a relatively longer temporal context of about 150 ms. Experiments on the Mandarin DARPA GALE eval06 dataset show significant reduction (about 7.6% relative) in character error rates by using hierarchical Tandem features over conventional Tandem features.

Details

Title Hierarchical Tandem Features for ASR in Mandarin

Author(s) Pinto, Joel Praveen ; Magimai.-Doss, Mathew ; Bourlard, Hervé

Date 2010

Publisher Idiap

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Work produced at EPFL
Technical Reports
Published

Record creation date 2010-11-17

Actions

Preview

Select file: