Non-Uniform Speech/Audio Coding Exploiting Predictability of Temporal Evolution of Spectral Envelopes

Motlicek, Petr; Hermansky, Hynek; Ganapathy, Sriram; Garudadri, Harinath

doi:10.1007/978-3-540-74628-7_46

Motlicek, Petr; Hermansky, Hynek; Ganapathy, Sriram; Garudadri, Harinath

2007

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Unlike classical state-of-the-art coders that are based on short-term spectra, our approach uses relatively long temporal segments of audio signal in critical-band-sized sub-bands. We apply auto-regressive model to approximate Hilbert envelopes in frequency sub-bands. Residual signals (Hilbert carriers) are demodulated and thresholding functions are applied in spectral domain. The Hilbert envelopes and carriers are quantized and transmitted to the decoder. Our experiments focused on designing speech/audio coder to provide broadcast radio-like quality audio around 15-25kbps. Obtained objective quality measures, carried out on standard speech recordings, were compared to the state-of-the-art 3GPP-AMR speech coding system.

Details

Title Non-Uniform Speech/Audio Coding Exploiting Predictability of Temporal Evolution of Spectral Envelopes

Author(s) Motlicek, Petr ; Hermansky, Hynek ; Ganapathy, Sriram ; Garudadri, Harinath

Published in TSD 2007: Text, Speech and Dialogue

Pages 350–357

Conference Tenth International Conference on TEXT, SPEECH and DIALOGUE (TSD)

Date 2007

Note IDIAP-RR 06-30

DOI https://doi.org/10.1007/978-3-540-74628-7_46

Additional link URL; Related documents

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Conference Papers
Work produced at EPFL
Published

Record creation date 2010-02-11

Actions

Preview

Select file: