Autoregressive Models of Amplitude Modulations in Audio Compression

Ganapathy, Sriram; Motlicek, Petr; Hermansky, Hynek

doi:10.1109/TASL.2009.2038813

Ganapathy, Sriram; Motlicek, Petr; Hermansky, Hynek

2010

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

We present a scalable medium bit-rate wide-band audio coding technique based on frequency domain linear prediction (FDLP). FDLP is an efficient method for representing the long-term amplitude modulations of speech/audio signals using autoregressive models. For the proposed audio codec, relatively long temporal segments (1000 ms) of the input audio signal are decomposed into a set of critically sampled sub-bands using a quadrature mirror filter (QMF) bank. The technique of FDLP is applied on each sub-band to model the sub-band temporal envelopes. The residual of the linear prediction, which represents the frequency modulations in the sub-band signal [1], are encoded and transmitted along with the envelope parameters. These steps are reversed at the decoder to reconstruct the signal. The proposed codec utilizes a simple signal independent non-adaptive compression mechanism for a wide class of speech and audio signals. The subjective and objective quality evaluations show that the reconstruction signal quality for the proposed FDLP codec compares well with the state-of-the-art audio codecs in the 32-64 kbps range.

Details

Title Autoregressive Models of Amplitude Modulations in Audio Compression

Author(s) Ganapathy, Sriram ; Motlicek, Petr ; Hermansky, Hynek

Published in IEEE Transactions on Audio, Speech, and Language Processing

Volume 18

Issue 6

Pages 1624-1631

Date 2010

DOI https://doi.org/10.1109/TASL.2009.2038813

Additional link Related documents

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2010-02-11