Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Modulation Frequency Features For Phoneme Recognition In Noisy Speech
 
research article

Modulation Frequency Features For Phoneme Recognition In Noisy Speech

Ganapathy, Sriram  
•
Thomas, Samuel
•
Hermansky, Hynek  
2008
Journal of the Acoustical Society of America

In this letter, a new feature extraction technique based on modulation spectrum derived from syllable-length segments of sub-band temporal envelopes is proposed. These sub-band envelopes are derived from auto-regressive modelling of Hilbert envelopes of the signal in critical bands, processed by both a static (logarithmic) and a dynamic (adaptive loops) compression. These features are then used for machine recognition of phonemes in telephone speech. Without degrading the performance in clean conditions, the proposed features show significant improvements compared to other state-of-the-art speech analysis techniques. In addition to the overall phoneme recognition rates, the performance with broad phonetic classes is reported.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

Ganapathy_JASA-EL_2008.pdf

Access type

openaccess

Size

237.43 KB

Format

Adobe PDF

Checksum (MD5)

7d1ba5b49cd34b7085f1908779e9e205

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés