Encoder-Driven Inpainting Strategy in Multiview Video Compression

Gao, Yu; Cheung, Gene; Maugey, Thomas; Frossard, Pascal; Liang, Jie

doi:10.1109/Tip.2015.2498400

Gao, Yu; Cheung, Gene; Maugey, Thomas; Frossard, Pascal; Liang, Jie

2016

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

In free viewpoint video systems, where a user has the freedom to select a virtual view from which an observation image of the 3D scene is rendered, the scene is commonly represented by texture and depth images from multiple nearby viewpoints. In such representation, there exists data redundancy across multiple dimensions: a single visible 3D voxel may be represented by pixels in multiple viewpoint images (inter-view redundancy), a pixel patch may recur in a distant spatial region of the same image due to self-similarity (inter-patch redundancy), and pixels in a local spatial region tend to be similar (inter-pixel redundancy). It isimportant to exploit these redundancies for effective multiview video compression. Existing schemes attempt to eliminate them via the traditional video coding paradigm of hybrid signal prediction/residual coding; typically, the encoder codes explicit information to guide the decoder to the location of the most similar block along with the signal differential. In this paper, we argue that, given the inherent redundancy in the representation, the decoder can often independently recover missing data via inpainting without explicit directions from encoder, resulting in lower coding overhead. Specifically, after pixels in a reference view are projected to a target view via depth image-based rendering (DIBR) at the decoder, the remaining holes in the target view are filled via an inpainting process in a block-by-block manner. First, blocks are ordered in terms of difficulty-to-inpaint by the decoder. Then, explicit instructions are only sent for the reconstruction of the most difficult blocks. In particular, the missing pixels are explicitly coded via a graph Fourier transform (GFT) or a sparsification procedure using DCT, which leads to low coding cost. For the blocks that are easy to inpaint, the decoder independently completes missing pixels via template-based inpainting. We implemented our encoder-driven inpainting strategy as an extension of High Efficiency Video Coding (HEVC). Experimental results show that our coding strategy can outperform comparable implementation of HEVC by up to 0.8dB in reconstructed image quality

Details

Title Encoder-Driven Inpainting Strategy in Multiview Video Compression

Author(s) Gao, Yu ; Cheung, Gene ; Maugey, Thomas ; Frossard, Pascal ; Liang, Jie

Published in IEEE Transactions on Image Processing

Pagination 16

Volume 25

Issue 1

Pages 134-149

Date 2016

Publisher Piscataway, Institute of Electrical and Electronics Engineers

ISSN 1057-7149

Keywords

Video compression; predictive encoding; transform coding

DOI https://doi.org/10.1109/Tip.2015.2498400

Other identifier(s) View record in Web of Science

Laboratories LTS4

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LTS4 - Signal Processing Laboratory 4
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2014-08-07

Actions

Preview

Select file: