Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain
 
Loading...
Thumbnail Image
conference paper not in proceedings

Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain

Thomas, Samuel
•
Ganapathy, Sriram  
•
Hermansky, Hynek  
2008
EUSIPCO 2008

Frequency Domain Linear Prediction (FDLP) provides an efficient way to represent temporal envelopes of a signal using auto-regressive models. For the input speech signal, we use FDLP to estimate temporal trajectories of sub-band energy by applying linear prediction on the cosine transform of sub-band signals. The sub-band FDLP envelopes are used to extract spectral and temporal features for speech recognition. The spectral features are derived by integrating the temporal envelopes in short-term frames and the temporal features are formed by converting these envelopes into modulation frequency components. These features are then combined in the phoneme posterior level and used as the input features for a hybrid HMM-ANN based phoneme recognizer. The proposed spectro-temporal features provide a phoneme recognition accuracy of $69.1 %$ (an improvement of $4.8 %$ over the Perceptual Linear Prediction (PLP) base-line) for the TIMIT database.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

tsamuel-eusipco-2008.pdf

Access type

openaccess

Size

266.13 KB

Format

Adobe PDF

Checksum (MD5)

162280df4bf20c874e354ac9eec84478

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés