Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. VL4Pose: Active Learning Through Out-Of-Distribution Detection For Pose Estimation
 
conference paper

VL4Pose: Active Learning Through Out-Of-Distribution Detection For Pose Estimation

Shukla, Megh  
•
Roy, Roshan
•
Singh, Pankaj
Show more
November 21, 2022
Proceedings of the 33rd British Machine Vision Conference
33rd British Machine Vision Conference (BMVC 2022)

Advances in computing have enabled widespread access to pose estimation, creating new sources of data streams. Unlike mock set-ups for data collection, tapping into these data streams through on-device active learning allows us to directly sample from the real world to improve the spread of the training distribution. However, on-device computing power is limited, implying that any candidate active learning algorithm should have a low compute footprint while also being reliable. Although multiple algorithms cater to pose estimation, they either use extensive compute to power state-of-the-art results or are not competitive in low-resource settings. We address this limitation with VL4Pose (Visual Likelihood For Pose Estimation), a first principles approach for active learning through out-of-distribution detection. We begin with a simple premise: pose estimators often predict incoherent poses for out-of-distribution samples. Hence, can we identify a distribution of poses the model has been trained on, to identify incoherent poses the model is unsure of? Our solution involves modelling the pose through a simple parametric Bayesian network trained via maximum likelihood estimation. Therefore, poses incurring a low likelihood within our framework are out-of-distribution samples making them suitable candidates for annotation. We also observe two useful side-outcomes: VL4Pose in-principle yields better uncertainty estimates by unifying joint and pose level ambiguity, as well as the unintentional but welcome ability of VL4Pose to perform pose refinement in limited scenarios. We perform qualitative and quantitative experiments on three datasets: MPII, LSP and ICVL, spanning human and hand pose estimation. Finally, we note that VL4Pose is simple, computationally inexpensive and competitive, making it suitable for challenging tasks such as on-device active learning.

  • Files
  • Details
  • Metrics
Type
conference paper
Author(s)
Shukla, Megh  
Roy, Roshan
Singh, Pankaj
Ahmed, Shuaib
Alahi, Alexandre  
Date Issued

2022-11-21

Publisher

BMVA Press

Publisher place

London

Published in
Proceedings of the 33rd British Machine Vision Conference
Total of pages

10

Subjects

active learning

•

human pose

•

out of distribution

•

pose refinement

•

hand pose

•

keypoint estimation

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
VITA  
Event nameEvent placeEvent date
33rd British Machine Vision Conference (BMVC 2022)

London, UK

November 21-24, 2022

Available on Infoscience
October 31, 2022
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/191711
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés