Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. OSLO-IC: On-the-Sphere Learned Omnidirectional Image Compression with Attention Modules and Spatial Context
 
conference paper

OSLO-IC: On-the-Sphere Learned Omnidirectional Image Compression with Attention Modules and Spatial Context

Wawerek-López, Paul
•
Bidgoli, Navid Mahmoudian
•
Frossard, Pascal  
Show more
April 6, 2025
Proceedings of the 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Developing effective 360-degree (spherical) image compression techniques is crucial for technologies like virtual reality and automated driving. This paper advances the state-of-the-art in on-the-sphere learning (OSLO) for omnidirectional image compression framework by proposing spherical attention modules, residual blocks, and a spatial autoregressive context model. These improvements achieve a 23.1% bit rate reduction in terms of WS-PSNR BD rate. Additionally, we introduce a spherical transposed convolution operator for upsampling, which reduces trainable parameters by a factor of four compared to the pixel shuffling used in the OSLO framework, while maintaining similar compression performance. Therefore, in total, our proposed method offers significant rate savings with a smaller architecture and can be applied to any spherical convolutional application.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/icassp49660.2025.10889131
Author(s)
Wawerek-López, Paul

Friedrich-Alexander-Universität Erlangen-Nürnberg,Multimedia Communications and Signal Processing,Germany

Bidgoli, Navid Mahmoudian

Trimble Inc.

Frossard, Pascal  

EPFL

Kaup, André

Friedrich-Alexander-Universität Erlangen-Nürnberg,Multimedia Communications and Signal Processing,Germany

Maugey, Thomas

Institut National de Recherche en Informatique et en Automatique (INRIA),Rennes,France

Date Issued

2025-04-06

Publisher

IEEE

Publisher place

Piscataway, NJ

Published in
Proceedings of the 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
DOI of the book
https://doi.org/10.1109/ICASSP49660.2025
ISBN of the book

979-8-3503-6874-1

Start page

1

End page

5

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LTS4  
Event nameEvent acronymEvent placeEvent date
ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

ICASSP 2025

Hyderabad, India

2025-04-06 - 2025-04-11

FunderFunding(s)Grant NumberGrant URL

Deutsche Forschungsgemeinschaft

Available on Infoscience
April 15, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/249263
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés