Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Rethinking Pose Estimation in Crowds: Overcoming the Detection Information Bottleneck and Ambiguity
 
conference paper

Rethinking Pose Estimation in Crowds: Overcoming the Detection Information Bottleneck and Ambiguity

Zhou, Mu  
•
Stoffl, Lucas  
•
Mathis, Mackenzie  
Show more
2023
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023
IEEE/CVF International Conference on Computer Vision (ICCV)

Frequent interactions between individuals are a fundamental challenge for pose estimation algorithms. Current pipelines either use an object detector together with a pose estimator (top-down approach), or localize all body parts first and then link them to predict the pose of individuals (bottom-up). Yet, when individuals closely interact, top-down methods are ill-defined due to overlapping individuals, and bottom-up methods often falsely infer connections to distant bodyparts. Thus, we propose a novel pipeline called bottom-up conditioned top-down pose estimation (BUCTD) that combines the strengths of bottom-up and top-down methods. Specifically, we propose to use a bottom-up model as the detector, which in addition to an estimated bounding box provides a pose proposal that is fed as condition to an attention-based top-down model. We demonstrate the performance and efficiency of our approach on animal and human pose estimation benchmarks. On CrowdPose and OCHuman, we outperform previous state-of-the-art models by a significant margin. We achieve 78.5 AP on CrowdPose and 48.5 AP on OCHuman, an improvement of 8.6% and 7.8% over the prior art, respectively. Furthermore, we show that our method strongly improves the performance on multi-animal benchmarks involving fish and monkeys. The code is available at https://github.com/amathislab/BUCTD.

  • Files
  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ICCV51070.2023.01350
Web of Science ID

WOS:001169499007012

Author(s)
Zhou, Mu  
Stoffl, Lucas  
Mathis, Mackenzie  
Mathis, Alexander  
Date Issued

2023

Published in
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023
Start page

14689

End page

14699

Editorial or Peer reviewed

NON-REVIEWED

Written at

EPFL

Event nameEvent placeEvent date
IEEE/CVF International Conference on Computer Vision (ICCV)

Paris

October 2-6, 2023

RelationURL/DOI

IsSupplementedBy

https://infoscience.epfl.ch/record/306197
Available on Infoscience
November 1, 2023
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/201989
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés