Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Reports, Documentation, and Standards
  4. Introducing Temporal Asymmetries in Feature Extraction for Automatic Speech Recognition
 
report

Introducing Temporal Asymmetries in Feature Extraction for Automatic Speech Recognition

Sivaram, G. S. V. S.
•
Hermansky, Hynek  
2008

We propose a new auditory inspired feature extraction technique for automatic speech recognition (ASR). Features are extracted by filtering the temporal trajectory of spectral energies in each critical band of speech by a bank of finite impulse response (FIR) filters. Impulse responses of these filters are derived from a modified Gabor envelope in order to emulate asymmetries of the temporal receptive field (TRF) profiles observed in higher level auditory neurons. We obtain $11.4% $ relative improvement in word error rate on OGI-Digits database and, $3.2%$ relative improvement in phoneme error rate on TIMIT database over the MRASTA technique.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

sgarimel-idiap-rr-08-25.pdf

Access type

openaccess

Size

138.93 KB

Format

Adobe PDF

Checksum (MD5)

01c8ce1efe67abf9d91acf663f57e9f9

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés