Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding

This paper proposes an analysis technique for wide-band audio applications based on the predictability of the temporal evolution of Quadrature Mirror Filter (QMF) sub-band signals. The input audio signal is first decomposed into 64 sub-band signals using QMF decomposition. The temporal envelopes in critically sampled QMF sub-bands are approximated using frequency domain linear prediction applied over relatively long time segments (e.g. 1000 ms). Line Spectral Frequency parameters related to autoregressive models are computed and quantized in each frequency sub-band. The sub-band residuals are quantized in the frequency domain using a combination of split Vector Quantization (VQ) (for magnitudes) and uniform scalar quantization (for phases). In the decoder, the sub-band signal is reconstructed using the quantized residual and the corresponding quantized envelope. Finally, application of inverse QMF reconstructs the audio signal. Even with simple quantization techniques and without any sophisticated modules, the proposed audio coder provides encouraging results in objective quality tests. Also, the proposed coder is easily scalable across a wide range of bit-rates.

Presented at:
4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI)
IDIAP-RR 07-16

 Record created 2010-02-11, last modified 2018-01-28

External links:
Download fulltextURL
Download fulltextRelated documents
Download fulltextn/a
Rate this document:

Rate this document:
(Not yet reviewed)