Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. EPFL thesis
  4. Improving Human Pose & Shape Estimation with Explicit and Implicit Priors
 
doctoral thesis

Improving Human Pose & Shape Estimation with Explicit and Implicit Priors

Davydov, Andrey  
2025

Computer vision has made remarkable strides in recent decades, becoming a cornerstone of modern technology with applications ranging from autonomous driving to medical imaging. However, despite these advancements, several core challenges remain, especially in tasks that involve understanding and modeling complex, real-world scenes. One particularly difficult domain is human-related computer vision, where the goal is to accurately estimate human poses, shapes, and movements from visual data. This field is fraught with challenges due to the variability in human appearance, the need for fine-grained details, and the limitations of existing models in handling occlusions, body symmetries, natural human dynamics, and their overreliance on large annotated datasets.

The present thesis addresses these challenges by proposing several novel solutions to key issues in human pose and shape estimation. First, it tackles the problem of inconsistencies in skeleton-based models, where left/right symmetries are often poorly maintained. A method is proposed to enforce symmetry constraints, improving the anatomical plausibility of keypoint-based skeleton models. This contribution enhances the accuracy of skeleton estimates, making them more consistent and realistic.

Next, the thesis addresses the generation of implausible poses by developing a generative prior that restricts pose generation to only realistic body shapes. This ensures that human pose estimators produce plausible outputs, even when dealing with complex body configurations. This method contributes to the robustness of pose estimation models, improving their performance in various applications.

Another issue we handle is the uncontrollable mesh interpenetrations, which is inherent to volume-based representations. This leads to unrealistic body shapes where parts of the body overlap. For this, we introduce a differentiable flow-based solution. Our technique resolves self-intersections while preserving the underlying body shape, ensuring that the estimated meshes are not only accurate but also physically plausible.

Finally, the thesis proposes a solution to the overreliance on large annotated datasets, which are often difficult and costly to collect. By leveraging motion cues such as optical flow, this work demonstrates how models can be trained more effectively in data-scarce environments. This data-efficient supervision approach reduces the dependency on annotated datasets while still maintaining high model performance, opening the door for applications in fields where data collection is limited.

While these advancements represent significant progress in human-related computer vision, certain limitations persist. In conclusion, we discuss these limitations of the proposed methods, suggest potential avenues for improvement, and speculate on the future directions for human-related computer vision research.

  • Files
  • Details
  • Metrics
Type
doctoral thesis
DOI
10.5075/epfl-thesis-10577
Author(s)
Davydov, Andrey  

EPFL

Advisors
Fua, Pascal  
•
Salzmann, Mathieu  
Jury

Prof. Nicolas Henri Bernard Flammarion (président) ; Prof. Pascal Fua, Dr Mathieu Salzmann (directeurs) ; Prof. Alexandre Alahi, Dr Marc Habermann, Prof. Siyu Tang (rapporteurs)

Date Issued

2025

Publisher

EPFL

Publisher place

Lausanne

Public defense year

2025-02-21

Thesis number

10577

Total of pages

121

Subjects

computer vision

•

deep learning

•

human pose estimation

•

human shape recovery

•

generative models

•

data efficiency

•

symmetry constraints

•

self-interpenetrations

EPFL units
CVLAB  
Faculty
IC  
School
IINFCOM  
Doctoral School
EDIC  
Available on Infoscience
February 12, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/246886
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés