Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Neural Distributed Image Compression with Cross-Attention Feature Alignment
 
conference paper

Neural Distributed Image Compression with Cross-Attention Feature Alignment

Mital, Nitish
•
Ozyilkan, Ezgi
•
Garjani, Ali  
Show more
January 1, 2023
2023 Ieee/Cvf Winter Conference On Applications Of Computer Vision (Wacv)
23rd IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

We consider the problem of compressing an information source when a correlated one is available as side information only at the decoder side, which is a special case of the distributed source coding problem in information theory. In particular, we consider a pair of stereo images, which have overlapping fields of view, and are captured by a synchronized and calibrated pair of cameras as correlated image sources. In previously proposed methods, the encoder transforms the input image to a latent representation using a deep neural network, and compresses the quantized latent representation losslessly using entropy coding. The decoder decodes the entropy-coded quantized latent representation, and reconstructs the input image using this representation and the available side information. In the proposed method, the decoder employs a cross-attention module to align the feature maps obtained from the received latent representation of the input image and a latent representation of the side information. We argue that aligning the correlated patches in the feature maps allows better utilization of the side information. We empirically demonstrate the competitiveness of the proposed algorithm on KITTI and Cityscape datasets of stereo image pairs. Our experimental results show that the proposed architecture is able to exploit the decoder-only side information in a more efficient manner compared to previous works.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/WACV56688.2023.00253
Web of Science ID

WOS:000971500202060

Author(s)
Mital, Nitish
Ozyilkan, Ezgi
Garjani, Ali  
Gunduz, Deniz
Date Issued

2023-01-01

Publisher

IEEE COMPUTER SOC

Publisher place

Los Alamitos

Published in
2023 Ieee/Cvf Winter Conference On Applications Of Computer Vision (Wacv)
ISBN of the book

978-1-6654-9346-8

Series title/Series vol.

IEEE Winter Conference on Applications of Computer Vision

Start page

2497

End page

2506

Subjects

Computer Science, Artificial Intelligence

•

Engineering, Electrical & Electronic

•

Imaging Science & Photographic Technology

•

Computer Science

•

Engineering

•

Imaging Science & Photographic Technology

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
VILAB  
Event nameEvent placeEvent date
23rd IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

Waikoloa, HI

Jan 03-07, 2023

Available on Infoscience
July 31, 2023
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/199505
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés