Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. HAECcity: Open-Vocabulary Scene Understanding of City-Scale Point Clouds with Superpoint Graph Clustering
 
conference paper

HAECcity: Open-Vocabulary Scene Understanding of City-Scale Point Clouds with Superpoint Graph Clustering

Rusnak, Alexander  
•
Kaplan, Frederic  
2025
2025 IEEE/CFV Computer Society Conference on Computer Vision and Pattern Recognition Workshops. CVPRW 2025
2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops

Traditional 3D scene understanding techniques are generally predicated on hand-annotated label sets, but in recent years a new class of open-vocabulary 3D scene understanding techniques has emerged. Despite the success of this paradigm on small scenes, existing approaches cannot scale efficiently to city-scale 3D datasets. In this paper, we present Hiearchical vocab-Agnostic Expert Clustering (HAEC), after the latin word for 'these', a superpoint graph clustering based approach which utilizes a novel mixture of experts graph transformer for its backbone. We administer this highly scalable approach to the first application of open-vocabulary scene understanding on the SensatUrban city-scale dataset. We also demonstrate a synthetic labeling pipeline which is derived entirely from the raw point clouds with no hand-annotation. Our technique can help unlock complex operations on dense urban 3D scenes and open a new path forward in the processing of digital twins.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/CVPRW67362.2025.00521
Scopus ID

2-s2.0-105017843222

Author(s)
Rusnak, Alexander  

École Polytechnique Fédérale de Lausanne

Kaplan, Frederic  

École Polytechnique Fédérale de Lausanne

Date Issued

2025

Publisher

IEEE Computer Society

Published in
2025 IEEE/CFV Computer Society Conference on Computer Vision and Pattern Recognition Workshops. CVPRW 2025
DOI of the book
https://doi.org/10.1109/CVPRW67362.2025
ISBN of the book

9798331599942

Start page

5256

End page

5265

Subjects

deep learning

•

digital twin

•

open vocabulary

•

point cloud processing

•

scene understanding

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
DHLAB  
Event nameEvent acronymEvent placeEvent date
2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops

Nashville, TN, USA

2025-06-11 - 2025-06-12

Available on Infoscience
October 14, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/254934
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés