Evaluating the effect of sparse convolutions on point cloud compression

Lazzarotto, Davi; Ebrahimi, Touradj

2023

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Résumé

The use of point clouds as an imaging modality has been rapidly growing, motivating research on compression methods to enable efficient transmission and storage for many applications. While compression standards relying on conven- tional techniques such as planar projection and octree-based representation have been standardized by MPEG, recent research has demonstrated the potential of neural networks in achieving better rate-distortion performance for point cloud geometry cod- ing. Early attempts in learning-based point cloud coding mostly relied on autoencoder architectures using dense convolutional layers, but the majority of recent research has shifted towards the use of sparse convolutions, which are applied only to occupied positions rather than the entire space. Since points are usually distributed on underlying surfaces rather than volumes, such operations allow to reduce the computational complexity required to compress and decompress point clouds. Moreover, recent solutions also achieve better compression efficiency, allocating fewer bits at similar levels of geometric distortion. However, it is not clear to which extent this gain in performance is due to the use of sparse convolutions, if any at all, since the architecture of the model is often modified. In this paper, we conduct an evaluation of the effect of replacing dense convolutions with sparse convolutions on the rate-distortion performance of the JPEG Pleno Point Cloud Verification Model. Results show that the use of sparse convolutions allows for an average BD-rate reduction of approximately 9% for both D1 and D2 PSNR metrics based on similar training procedures, with an even bigger reduction in point clouds featuring reduced point density.

Détails

Titre Evaluating the effect of sparse convolutions on point cloud compression

Auteur(s) Lazzarotto, Davi ; Ebrahimi, Touradj

Publié dans 11th European Workshop on Visual Information Processing (EUVIP)

Présenté à 11th European Workshop on Visual Information Processing (EUVIP)

Date 2023-09-11

Note This paper received the best student paper award

Laboratoires MMSPL

Le document apparaît dans Production scientifique et compétences > STI - Faculté des sciences et techniques de l'ingénieur > IEM - Institute of Electrical and Micro Engineering > MMSPL - Laboratoire de traitement des signaux multimédia
Publications validées par des pairs
Papiers de conférence
Travail produit à l'EPFL

Date de création de la notice 2023-08-21

Files

Résumé

Détails

PDF