Spectral Entropy Feature in Full-Combination Multi-Stream for Robust ASR

Misra, Hemant; Bourlard, Hervé

doi:10.21437/Interspeech.2005-247

Misra, Hemant; Bourlard, Hervé

2005

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

In a recent paper, we reported promising automatic speech recognition results obtained by appending spectral entropy features to PLP features. In the present paper, spectral entropy features are used along with PLP features in the framework of multi-stream combination. In a full-combination multi-stream hidden Markov model/artificial neural network (HMM/ANN) hybrid system, we train a separate multi-layered perceptron (MLP) for PLP features, for spectral entropy features and for both combined by concatenation. The output posteriors from these three MLPs are combined with weights inversely proportional to the entropies of their respective posterior distributions. We show that on the Numbers95 database, this approach yields a significant improvement under both clean and noisy conditions as compared to simply appending the features. Further, in the framework of a Tandem HMM/ANN system, we apply the same inverse entropy weighting to combine the outputs of the MLPs before the softmax non-linearity. Feeding the combined and decorrelated MLP outputs to the HMM gives a 9.2\% relative error reduction as compared to the baseline.

Details

Title Spectral Entropy Feature in Full-Combination Multi-Stream for Robust ASR

Author(s) Misra, Hemant ; Bourlard, Hervé

Published in Proceedings of ISCA European Conference on Speech Communication and Technology (Eurospeech)

Conference ISCA European Conference on Speech Communication and Technology (Eurospeech)

Date 2005

Publisher Lisbon, Portugal

Keywords

speech; misra; bourlard

Note IDIAP-RR 2005 10

DOI https://doi.org/10.21437/Interspeech.2005-247

Additional link URL; Related documents

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Conference Papers
Work produced at EPFL
Published

Record creation date 2006-03-10

Actions

Preview

Select file: