Emulating Temporal Receptive Fields of Higher Level Auditory Neurons for ASR

Sivaram, Garimella S. V. S.; Hermansky, Hynek

doi:10.1007/978-3-540-87391-4_65

conference paper

Emulating Temporal Receptive Fields of Higher Level Auditory Neurons for ASR

Sivaram, Garimella S. V. S.

•

Hermansky, Hynek

2008

Text, Speech And Dialogue, Proceedings

11th International Conference on Text, Speech and Dialogue

This paper proposes modifications to the Multi-resolution RASTA (MRASTA) feature extraction technique for the automatic speech recognition (ASR). By emulating asymmetries of the temporal receptive field (TRF) profiles of higher level auditory neurons, we obtain more than 11.4% relative improvement in word error rate on OGI-Digits database. Experiments on TIMIT database confirm that proposed modifications are indeed useful.

Type

conference paper

DOI

10.1007/978-3-540-87391-4_65

Web of Science ID

WOS:000259634700065

Author(s)

Sivaram, Garimella S. V. S.

Hermansky, Hynek

Date Issued

2008

Publisher

Springer-Verlag New York, Ms Ingrid Cunningham, 175 Fifth Ave, New York, Ny 10010 Usa

Published in

Text, Speech And Dialogue, Proceedings

Series title/Series vol.

Lecture Notes In Artificial Intelligence; 5246

Start page

509

End page

516

Subjects

Feature extraction

•

auditory neurons and speech recognition

•

Cortex

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units

LIDIAP

Event name	Event place	Event date
11th International Conference on Text, Speech and Dialogue	Brno, CZECH REPUBLIC	Sep 08-12, 2008

Available on Infoscience

November 30, 2010

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/60925