Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. EPFL thesis
  4. Aggregating Spatial and Photometric Context for Photometric Stereo
 
doctoral thesis

Aggregating Spatial and Photometric Context for Photometric Stereo

Honzátko, David  
2024

Photometric stereo, a computer vision technique for estimating the 3D shape of objects through images captured under varying illumination conditions, has been a topic of research for nearly four decades. In its general formulation, photometric stereo is an ill-posed problem and requires robust prior knowledge of material reflectance properties, light transport, and object shapes, all of which are quite difficult to obtain in many scenarios.

We focus on task of estimating the surface normals of an inspected object given a large, but apriori unknown, number of input images and the illumination directions under which these images were captured. This is also known as far-field dense calibrated photometric stereo, and it is the main topic of this thesis.

Like in many other computer vision fields, recent advances in photometric stereo have leveraged deep learning. Despite their success, these methods struggle with the large input data dimensionality, the disparity between the spatial domain and the domain of illumination directions, the apriori unknown number of observations provided for a scene, and the general unavailability of extensive real data collections to train them.

To tackle these issues, we formulate the problem as a four-dimensional regression and propose novel neural architectures that leverage both the spatial context of individual images and the photometric context captured in the intensity variations of individual pixels under different illumination directions. Our methods work with the concept of observation maps -- fixed-size two-dimensional planes, encoding pixel intensities together with the associated illumination directions for each pixel separately. This framework enabled the design of fully convolutional networks utilizing separable four-dimensional convolutions, which simultaneously process observation maps and image spatial dimensions, thus learning both reflectance and shape prior knowledge. With this approach, we achieve higher performance than the existing works.

Additionally, we introduce a fast rendering approach for on-the-fly sample generation during training, which allows for much larger diversity in shape and reflectance properties than existing static datasets offer. Coupled with an efficient training strategy, this approach enables training the four-dimensional neural architectures on standard consumer hardware within a reasonable timeframe. These innovations have culminated in state-of-the-art qualitative performance on all relevant benchmark datasets that feature real images, thus making a significant contribution to the field of photometric stereo.

  • Files
  • Details
  • Metrics
Type
doctoral thesis
DOI
10.5075/epfl-thesis-9806
Author(s)
Honzátko, David  
Advisors
Fua, Pascal  
•
Türetken, Engin  
Jury

Dr Martin Rajman (président) ; Prof. Pascal Fua, Dr Engin Türetken (directeurs) ; Prof. Sabine Süsstrunk, Prof. Yasuyuki Matsushita, Dr Mathieu Aubry (rapporteurs)

Date Issued

2024

Publisher

EPFL

Publisher place

Lausanne

Public defense year

2024-02-02

Thesis number

9806

Total of pages

114

Subjects

photometric stereo

•

synthetic data generation

•

four-dimensional convolutions

•

fully convolutional neural architectures

•

3D shape reconstruction

EPFL units
CVLAB  
Faculty
IC  
School
IINFCOM  
Doctoral School
EDIC  
Available on Infoscience
January 29, 2024
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/203212
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés