Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. EPFL thesis
  4. Image-based Pose Estimation for Previously-Unseen Objects
 
doctoral thesis

Image-based Pose Estimation for Previously-Unseen Objects

Zhao, Chen  
2025

Spatial Artificial Intelligence (spatial AI) is a field dedicated to enabling machines to perceive, understand, and interact with the physical world in 3D. This field is pivotal in bridging the gap between digital systems and the physical world. This thesis focuses on 3D interaction in Spatial AI, which is pivotal to applications such as robotic manipulation, virtual reality, and augmented reality. Central to 3D interaction is object pose estimation that determines the 3D object translation and 3D object orientation from visual input. In real applications, spatial AI systems are often deployed in dynamic, diverse, and unstructured environments, which demand algorithms that are both robust and capable of generalization. However, most existing object pose estimation methods operate at the instance level, restricting pose estimation to the same object instances during both training and testing. These methods become inapplicable in scenarios where previously new objects exist during testing. Therefore, this thesis addresses image-based pose estimation for previously unseen objects, aiming to develop methods generalizable to new objects. A key insight is that generalizable object pose estimation inherently relies on a reference, which plays a crucial role in both object identification and the definition of a canonical coordinate system. In this thesis, we investigate pose estimation under three different reference formulations: dense-view reference images, sparse-view reference images, and a single reference image.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

EPFL_TH11262.pdf

Type

Main Document

Version

Not Applicable (or Unknown)

Access type

openaccess

License Condition

N/A

Size

16.18 MB

Format

Adobe PDF

Checksum (MD5)

fb846831e66a5ef18672c0822f54baf1

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés