Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Dvmnet: Computing Relative Pose for Unseen Objects Beyond Hypotheses
 
conference paper

Dvmnet: Computing Relative Pose for Unseen Objects Beyond Hypotheses

Zhao, Chen  
•
Zhang, Tong
•
Dang, Zheng  
Show more
January 1, 2024
2024 IEEE/CVF Conference On Computer Vision And Pattern Recognition (CVPR)
2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition CVPR 2024

Determining the relative pose of an object between two images is pivotal to the success of generalizable object pose estimation. Existing approaches typically approximate the continuous pose representation with a large number of discrete pose hypotheses, which incurs a computationally expensive process of scoring each hypothesis at test time. By contrast, we present a Deep Voxel Matching Network (DVMNet) that eliminates the need for pose hypotheses and computes the relative object pose in a single pass. To this end, we map the two input RGB images, reference and query, to their respective voxelized 3D representations. We then pass the resulting voxels through a pose estimation module, where the voxels are aligned and the pose is computed in an end-to-end fashion by solving a least-squares problem. To enhance robustness, we introduce a weighted closest voxel algorithm capable of mitigating the impact of noisy voxels. We conduct extensive experiments on the CO3D, LINEMOD, and Objaverse datasets, demonstrating that our method delivers more accurate relative pose estimates for novel objects at a lower computational cost compared to state-of-the-art methods. Our code is released at: https://github.com/sailor-z/DVMNet/.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/CVPR52733.2024.01936
Web of Science ID

WOS:001342515503079

Author(s)
Zhao, Chen  

École Polytechnique Fédérale de Lausanne

Zhang, Tong

École Polytechnique Fédérale de Lausanne

Dang, Zheng  

École Polytechnique Fédérale de Lausanne

Salzmann, Mathieu  

École Polytechnique Fédérale de Lausanne

Date Issued

2024-01-01

Publisher

IEEE

Publisher place

Los Alamitos

Published in
2024 IEEE/CVF Conference On Computer Vision And Pattern Recognition (CVPR)
DOI of the book
https://doi.org/10.1109/CVPR52733.2024
ISBN of the book

979-8-3503-5300-6

Series title/Series vol.

IEEE Conference on Computer Vision and Pattern Recognition

ISSN (of the series)

1063-6919

Start page

20485

End page

20495

Subjects

Science & Technology

•

Technology

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
CVLAB  
Event nameEvent acronymEvent placeEvent date
2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition CVPR 2024

IEEE/CVF 20240

Seattle, WA, USA

2024-06-17 - 2024-06-21

FunderFunding(s)Grant NumberGrant URL

Swiss National Science Foundation (SNSF)

CRSII5-180359

Swiss Innovation Agency (Innosuisse) via the BRIDGE Discovery grant

40B2-0 194729

Swiss National Science Foundation (SNSF)

40B2-0_194729

Available on Infoscience
May 26, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/250493
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés