Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans

Eftekhar, Ainaz; Sax, Alexander; Malik, Jitendra; Zamir, Amir

doi:10.1109/ICCV48922.2021.01061

conference paper

Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans

Eftekhar, Ainaz

•

Sax, Alexander

•

Malik, Jitendra

more

January 1, 2021

2021 Ieee/Cvf International Conference On Computer Vision (Iccv 2021)

18th IEEE/CVF International Conference on Computer Vision (ICCV)

This paper introduces a pipeline to parametrically sample and render static multi-task vision datasets from comprehensive 3D scans from the real-world. In addition to enabling interesting lines of research, we show the tooling and generated data suffice to train robust vision models. Familiar architectures trained on a generated starter dataset reached state-of-the-art performance on multiple common vision tasks and benchmarks, despite having seen no benchmark or non-pipeline data. The depth estimation network outperforms MiDaS and the surface normal estimation network is the first to achieve human-level performance for in-the-wild surface normal estimation-at least according to one metric on the OASIS benchmark.

The Dockerized pipeline with CLI, the (mostly python) code, PyTorch dataloaders for the generated data, the generated starter dataset, download scripts and other utilities are all available through our project website.

Type

conference paper

DOI

10.1109/ICCV48922.2021.01061

Web of Science ID

WOS:000798743200075

Authors

Eftekhar, Ainaz

•

Sax, Alexander

•

Malik, Jitendra

•

Zamir, Amir

Publication date

2021-01-01

Publisher

IEEE

Published in

2021 Ieee/Cvf International Conference On Computer Vision (Iccv 2021)

ISBN of the book

978-1-6654-2812-5

Publisher place

New York

Start page

10766

End page

10776

Subjects

Computer Science, Art...

Computer Science, The...

Computer Science

Peer reviewed

REVIEWED

EPFL units

VILAB

Event name	Event place	Event date
18th IEEE/CVF International Conference on Computer Vision (ICCV)	ELECTR NETWORK	Oct 11-17, 2021

Available on Infoscience

July 4, 2022

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/188908