000197841 001__ 197841
000197841 005__ 20181119005203.0
000197841 020__ $$a978-0-8194-9931-8
000197841 0247_ $$2doi$$a10.1117/12.2047850
000197841 02470 $$2ISI$$a000334350600002
000197841 037__ $$aCONF
000197841 245__ $$aAudiovisual focus of attention and its application to Ultra High Definition video compression
000197841 269__ $$a2014
000197841 260__ $$aBellingham$$bSPIE$$c2014
000197841 300__ $$a12
000197841 336__ $$aConference Papers
000197841 490__ $$aProceedings of SPIE
000197841 520__ $$aUsing Focus of Attention (FoA) as a perceptual process in image and video compression belongs to well-known approaches to increase coding efficiency. It has been shown that foveated coding, when compression quality varies across the image according to region of interest, is more efficient than the alternative coding, when all region are compressed in a similar way. However, widespread use of such foveated compression has been prevented due to two main conflicting causes, namely, the complexity and the efficiency of algorithms for FoA detection. One way around these is to use as much information as possible from the scene. Since most video sequences have an associated audio, and moreover, in many cases there is a correlation between the audio and the visual content, audiovisual FoA can improve efficiency of the detection algorithm while remaining of low complexity. This paper discusses a simple yet efficient audiovisual FoA algorithm based on correlation of dynamics between audio and video signal components. Results of audiovisual FoA detection algorithm are subsequently taken into account for foveated coding and compression. This approach is implemented into H.265/HEVC encoder producing a bitstream which is fully compliant to any H.265/HEVC decoder. The influence of audiovisual FoA in the perceived quality of high and ultra-high definition audiovisual sequences is explored and the amount of gain in compression efficiency is analyzed.
000197841 6531_ $$aQuality assessment
000197841 6531_ $$aVideo coding
000197841 6531_ $$aFoveated coding
000197841 6531_ $$aAudiovisual source localization
000197841 6531_ $$aH.265/HEVC
000197841 6531_ $$aUltra High Definition
000197841 6531_ $$aAudiovisual focus of attention
000197841 700__ $$0244457$$aRerabek, Martin$$g206402
000197841 700__ $$0(EPFLAUTH)241141$$aNemoto, Hiromi$$g241141
000197841 700__ $$0242654$$aLee, Jong-Seok$$g184440
000197841 700__ $$0240223$$aEbrahimi, Touradj$$g105043
000197841 7112_ $$aSPIE electronic Imaging$$cSan Francisco, California, USA$$dFebruary 02, 2014
000197841 773__ $$j9014$$tHuman Vision and Electronic Imaging XIX
000197841 8564_ $$s5283710$$uhttps://infoscience.epfl.ch/record/197841/files/article-spie-ei-2014.pdf$$yPublisher's version$$zPublisher's version
000197841 909C0 $$0252077$$pMMSPL
000197841 909CO $$ooai:infoscience.tind.io:197841$$pSTI$$pconf$$qGLOBAL_SET
000197841 917Z8 $$x206402
000197841 917Z8 $$x206402
000197841 917Z8 $$x206402
000197841 937__ $$aEPFL-CONF-197841
000197841 973__ $$aEPFL$$rNON-REVIEWED$$sPUBLISHED
000197841 980__ $$aCONF