Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Efficient Video Coding based on Audio-Visual Focus of Attention
 
Loading...
Thumbnail Image
research article

Efficient Video Coding based on Audio-Visual Focus of Attention

Lee, Jong-Seok  
•
De Simone, Francesca  
•
Ebrahimi, Touradj  
2011
Journal of Visual Communication and Image Representation

This paper proposes an efficient video coding method using audio-visual focus of attention, which is based on the observation that sound-emitting regions in an audio-visual sequence draw viewers' attention. First, an audio-visual source localization algorithm is presented, where the sound source is identified by using the correlation between the sound signal and the visual motion information. The localization result is then used to encode different regions in the scene with different quality in such a way that regions close to the source are encoded with higher quality than those far from the source. This is implemented in the framework of H.264/AVC by assigning different quantization parameters for different regions. Through experiments with both standard and high definition sequences, it is demonstrated that the proposed method can yield considerable coding gains over the constant quantization mode of H.264/AVC without noticeable degradation of perceived quality.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

12_lee_jvc11.pdf

Access type

openaccess

Size

685.05 KB

Format

Adobe PDF

Checksum (MD5)

80db05ea93c78f1c1315ee8205b93652

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés