Learning-based Image Coding: Early Solutions Reviewing and Subjective Quality Evaluation

Ascenso, Joao; Akyazi, Pinar; Pereira, Fernando; Ebrahimi, Touradj

doi:10.1117/12.2555368

conference paper

Learning-based Image Coding: Early Solutions Reviewing and Subjective Quality Evaluation

Ascenso, Joao

•

Akyazi, Pinar

•

Pereira, Fernando

January 1, 2021

Optics, Photonics And Digital Technologies For Imaging Applications Vi

Conference on Optics, Photonics and Digital Technologies for Imaging Applications VI

Nowadays, image and video are the data types that consume most of the resources of modern communication channels, both in fixed and wireless networks. Thus, it is vital to compress visual data as much as possible, while maintaining some target quality level, to enable efficient storage and transmission. Deep learning (DL) image coding solutions, typically using an auto-encoder architecture, promise significant improvements in compression efficiency. These methods adopt a novel coding approach where the encoder-decoder architecture is mostly based on neural networks, notably with analysis and synthesis transforms learned from a large amount of training data and an appropriate loss function. There are limited amount of works targeting the subjective evaluation of DL learning-based image coding solutions compression performance. Since learning-based image codecs use complex and highly non-linear generative models, very different artifacts are present in the decoded images, when compared to conventional artifacts such as blockiness, blurring and ringing distortions typical of traditional DCT block-based and wavelet image coding. In this context, the main objective of this paper is to review, characterize and evaluate some of the most relevant learning-based image coding solutions in the literature. Regarding the subjective quality evaluation, the assessment tests were conducted during the 84th JPEG meeting in Brussels, Belgium, by a mix of experts and naive observers. These subjective tests evaluated the performance of five state-of-the-art learning-based image coding solutions against four conventional, standard image coding (HEVC, WebP, JPEG 2000 and JPEG), applied to eight natural images, at four different coding bitrates. The experimental results obtained show that the subjective quality obtained with the selected learning-based image coding solution are competitive with conventional codecs. Moreover, a thorough inspection on the visual results has revealed some of the typical artifacts encountered in the learning -based image coding.

Type

conference paper

DOI

10.1117/12.2555368

Web of Science ID

WOS:000671891800021

Author(s)

Ascenso, Joao

Akyazi, Pinar

Pereira, Fernando

Ebrahimi, Touradj

Date Issued

2021-01-01

Publisher

SPIE-INT SOC OPTICAL ENGINEERING

Publisher place

Bellingham

Published in

Optics, Photonics And Digital Technologies For Imaging Applications Vi

ISBN of the book

978-1-5106-3479-4

Series title/Series vol.

Proceedings of SPIE

Volume

11353

Start page

113530S

Subjects

Optics

•

Imaging Science & Photographic Technology

•

deep learning

•

image compression

•

performance assessment

•

subjective quality

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units

GR-EB

Event name	Event place	Event date
Conference on Optics, Photonics and Digital Technologies for Imaging Applications VI	ELECTR NETWORK	Apr 06-10, 2020

Available on Infoscience

July 31, 2021

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/180291