- English
- français
Video coding based on audio-visual attention
This paper proposes an efficient video coding method based on audio-visual attention, which is motivated by the fact that cross-modal interaction significantly affects humans’ perception of multimedia experiences. First, we propose an audio-visual source localization method to locate the sound source in a video sequence. Then, its result is used for applying spatial blurring to the images in order to reduce redundant high-frequency information and achieve coding efficiency. We demonstrate the effectiveness of the proposed method for H.264/AVC coding along with the results of a subjective test.
Keywords: video coding ; audio-visual attention ; cross-modal interaction ; source localization ; H.264 ; perceived audio-visual quality
Reference
- MMSPL-CONF-2009-003
- View record in Web of Science
Record created on 2009-03-12, modified on 2012-03-20