SD-Pose: Semantic Decomposition for Cross-Domain 6D Object Pose Estimation

Li, Zhigang; Hu, Yinlin; Salzmann, Mathieu; Ji, Xiangyang

doi:10.1609/aaai.v35i3.16298

Li, Zhigang; Hu, Yinlin; Salzmann, Mathieu; Ji, Xiangyang

2021

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

The current leading 6D object pose estimation methods rely heavily on annotated real data, which is highly costly to acquire. To overcome this, many works have proposed to introduce computer-generated synthetic data. However, bridging the gap between the synthetic and real data remains a severe problem. Images depicting different levels of realism/semantics usually have different transferability between the synthetic and real domains. Inspired by this observation, we introduce an approach, SD-Pose, that explicitly decomposes the input image into multi-level semantic representations and then combines the merits of each representation to bridge the domain gap. Our comprehensive analyses and experiments show that our semantic decomposition strategy can fully utilize the different domain similarities of different representations, thus allowing us to outperform the state of the art on modern 6D object pose datasets without accessing any real data during training.

Details

Title SD-Pose: Semantic Decomposition for Cross-Domain 6D Object Pose Estimation

Author(s) Li, Zhigang ; Hu, Yinlin ; Salzmann, Mathieu ; Ji, Xiangyang

Published in Thirty-Fifth Aaai Conference On Artificial Intelligence, Thirty-Third Conference On Innovative Applications Of Artificial Intelligence And The Eleventh Symposium On Educational Advances In Artificial Intelligence

Series AAAI Conference on Artificial Intelligence, 35

Pages 2020-2028

Conference 35th AAAI Conference on Artificial Intelligence / 33rd Conference on Innovative Applications of Artificial Intelligence / 11th Symposium on Educational Advances in Artificial Intelligence, Feb 02-09, 2021, ELECTR NETWORK

Date 2021-01-01

Publisher Palo Alto, ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE

ISSN 2159-5399
2374-3468

ISBN 978-1-57735-866-4

DOI https://doi.org/10.1609/aaai.v35i3.16298

Other identifier(s) View record in Web of Science

Laboratories CVLAB

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > CVLAB - Computer Vision Laboratory
Scientific production and competences > Euler Center for Signal Processing
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Record creation date 2021-09-11

Abstract

Details

Actions