Audiovisual focus of attention and its application to Ultra High Definition video compression

Rerabek, Martin; Nemoto, Hiromi; Lee, Jong-Seok; Ebrahimi, Touradj

doi:10.1117/12.2047850

Rerabek, Martin; Nemoto, Hiromi; Lee, Jong-Seok; Ebrahimi, Touradj

2014

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Using Focus of Attention (FoA) as a perceptual process in image and video compression belongs to well-known approaches to increase coding efficiency. It has been shown that foveated coding, when compression quality varies across the image according to region of interest, is more efficient than the alternative coding, when all region are compressed in a similar way. However, widespread use of such foveated compression has been prevented due to two main conflicting causes, namely, the complexity and the efficiency of algorithms for FoA detection. One way around these is to use as much information as possible from the scene. Since most video sequences have an associated audio, and moreover, in many cases there is a correlation between the audio and the visual content, audiovisual FoA can improve efficiency of the detection algorithm while remaining of low complexity. This paper discusses a simple yet efficient audiovisual FoA algorithm based on correlation of dynamics between audio and video signal components. Results of audiovisual FoA detection algorithm are subsequently taken into account for foveated coding and compression. This approach is implemented into H.265/HEVC encoder producing a bitstream which is fully compliant to any H.265/HEVC decoder. The influence of audiovisual FoA in the perceived quality of high and ultra-high definition audiovisual sequences is explored and the amount of gain in compression efficiency is analyzed.

Details

Title Audiovisual focus of attention and its application to Ultra High Definition video compression

Author(s) Rerabek, Martin ; Nemoto, Hiromi ; Lee, Jong-Seok ; Ebrahimi, Touradj

Published in Human Vision and Electronic Imaging XIX

Pagination 12

Series Proceedings of SPIE

Volume 9014

Conference SPIE electronic Imaging, San Francisco, California, USA, February 02, 2014

Date 2014

Publisher Bellingham, SPIE

ISBN 978-0-8194-9931-8

Keywords

Quality assessment; Video coding; Foveated coding; Audiovisual source localization; H.265/HEVC; Ultra High Definition; Audiovisual focus of attention

DOI https://doi.org/10.1117/12.2047850

Other identifier(s) View record in Web of Science

Laboratories MMSPL

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > MMSPL - Multimedia Signal Processing Laboratory
Conference Papers
Work produced at EPFL
Published

Record creation date 2014-03-24

Files

Abstract

Details

PDF