Structured Prediction of 3D Human Pose with Deep Neural Networks

Tekin, Bugra; Katircioglu, Isinsu; Salzmann, Mathieu; Lepetit, Vincent; Fua, Pascal

doi:10.5244/C.30.130

conference paper

Structured Prediction of 3D Human Pose with Deep Neural Networks

•

•

2016

Proceedings of the British Machine Vision Conference (BMVC)

British Machine Vision Conference (BMVC)

Most recent approaches to monocular 3D pose estimation rely on Deep Learning. They either train a Convolutional Neural Network to directly regress from image to 3D pose, which ignores the dependencies between human joints, or model these dependencies via a max-margin structured learning framework, which involves a high computational cost at inference time. In this paper, we introduce a Deep Learning regression architecture for structured prediction of 3D human pose from monocular images that relies on an overcomplete autoencoder to learn a high-dimensional latent pose representation and account for joint dependencies. We demonstrate that our approach outperforms state-of-the-art ones both in terms of structure preservation and prediction accuracy.

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/128411

Name

tekin_bmvc16.pdf

Access type

openaccess

Size

799.77 KB

Format

Adobe PDF

Checksum (MD5)

df6a93d51a97373c05f87fc86dc87f11