Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Neural Scene Decomposition for Multi-Person Motion Capture
 
conference paper

Neural Scene Decomposition for Multi-Person Motion Capture

Rhodin, Helge  
•
Constantin, Victor  
•
Katircioglu, Isinsu  
Show more
June 20, 2019
Conference On Computer Vision And Pattern Recognition (CVPR)
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Learning general image representations has proven key to the success of many computer vision tasks. For example, many approaches to image understanding problems rely on deep networks that were initially trained on ImageNet, mostly because the learned features are a valuable starting point to learn from limited labeled data. However, when it comes to 3D motion capture of multiple people, these features are only of limited use. In this paper, we therefore propose an approach to learning features that are useful for this purpose. To this end, we introduce a self-supervised approach to learning what we call a neural scene decomposition (NSD) that can be exploited for 3D pose estimation. NSD comprises three layers of abstraction to represent human subjects: spatial layout in terms of bounding-boxes and relative depth; a 2D shape representation in terms of an instance segmentation mask; and subject-specific appearance and 3D pose information. By exploiting self-supervision coming from multiview data, our NSD model can be trained end-to-end without any 2D or 3D supervision. In contrast to previous approaches, it works for multiple persons and full-frame images. Because it encodes 3D geometry, NSD can then be effectively leveraged to train a 3D pose estimation network from small amounts of annotated data. Our code and newly introduced boxing dataset is available at github.com and cvlab.epfl.ch.

  • Files
  • Details
  • Metrics
Type
conference paper
DOI
10.1109/CVPR.2019.00789
Author(s)
Rhodin, Helge  
Constantin, Victor  
Katircioglu, Isinsu  
Salzmann, Mathieu  
Fua, Pascal  
Date Issued

2019-06-20

Publisher

IEEE

Published in
Conference On Computer Vision And Pattern Recognition (CVPR)
ISBN of the book

978-1-7281-3293-8

Series title/Series vol.

IEEE Conference on Computer Vision and Pattern Recognition

Start page

7695

End page

7705

Note

CVPR 2019

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
CVLAB  
Event nameEvent placeEvent date
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Long Beach, CA

Jun 16-20, 2019

Available on Infoscience
May 25, 2019
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/156548
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés