Landmarking for Navigational Streaming of Stored High-Dimensional Media

Huang, Jiwu

doi:10.1109/TCSVT.2022.3150780

Yuan, Yuan

•

Cheung, Gene

•

Frossard, Pascal

August 1, 2022

Ieee Transactions On Circuits And Systems For Video Technology

Landmarking for Navigational Streaming of Stored High-Dimensional Media

research article

Modern media data such as 360 degrees videos and light field (LF) images are typically captured in much higher dimensions than the observers' visual displays. To efficiently browse high-dimensional media, a navigational streaming model is considered: a client navigates the media space by dictating a navigation path to a server, who in response transmits the corresponding pre-encoded media data units (MDU) to the client one-by-one in sequence. Assuming that the MDU quality is pre-chosen and fixed, the problem resides in selecting and storing redundant representations of MDUs at the server in order to best trade off storage and transmission costs, while enabling adequate user's random access. We address this problem with a landmark-based MDU optimization framework. The media space is divided into neighborhoods, each containing one landmark (a chosen MDU). MDUs in a neighborhood use the associated landmark as a predictor for inter-coding. Thus, for any MDU transition within the same neighborhood, only one inter-coded MDU transmission is required when the landmark resides in the decoder buffer. It results in lower transmission cost and enables navigational random access. To optimize an MDU structure, we employ tree-structured vector quantizer (TSVQ) to first optimize landmark locations, then iteratively add P-MDUs as refinements using a fast branch-and-bound technique. Taking interactive LF images and viewport adaptive 360 degrees images as illustrative applications, and I-, P- and previously proposed merge frames to intra- and inter-code MDUs, we show experimentally that landmarked MDU structures can noticeably reduce the expected transmission cost compared with MDU structures without landmarks.

Type

research article

DOI

10.1109/TCSVT.2022.3150780

Web of Science ID

WOS:000835828500059

Author(s)

Yuan, Yuan

•

Cheung, Gene

•

Frossard, Pascal

•

Zhao, H. Vicky

•

Huang, Jiwu

Date Issued

2022-08-01

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Published in

Ieee Transactions On Circuits And Systems For Video Technology

Volume

32

Issue

8

Start page

5663

End page

5679

Subjects

Engineering, Electric...

Engineering

navigation

media

costs

servers

streaming media

encoding

videos

navigational streamin...

media compression

distributed source co...

frame design

video

saliency

Peer reviewed

REVIEWED

Written at

EPFL

EPFL units

LTS4

Available on Infoscience

August 15, 2022

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/190080