Infoscience

Conference paper

Video coding based on audio-visual attention

This paper proposes an efficient video coding method based on audio-visual attention, which is motivated by the fact that cross-modal interaction significantly affects humans’ perception of multimedia experiences. First, we propose an audio-visual source localization method to locate the sound source in a video sequence. Then, its result is used for applying spatial blurring to the images in order to reduce redundant high-frequency information and achieve coding efficiency. We demonstrate the effectiveness of the proposed method for H.264/AVC coding along with the results of a subjective test.

Keywords: video coding ; audio-visual attention ; cross-modal interaction ; source localization ; H.264 ; perceived audio-visual quality

Reference

Record created on 2009-03-12, modified on 2012-03-20