Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Beyond Pixels: Semi-supervised Semantic Segmentation with a Multi-scale Patch-Based Multi-label Classifier
 
conference paper

Beyond Pixels: Semi-supervised Semantic Segmentation with a Multi-scale Patch-Based Multi-label Classifier

Howlader, Prantik
•
Das, Srijan
•
Le, Hieu  
Show more
Leonardis, Aleš
•
Ricci, Elisa
Show more
2025
Computer Vision – ECCV 2024 - 18th European Conference, Proceedings
18th European Conference on Computer Vision

Incorporating pixel contextual information is critical for accurate segmentation. In this paper, we show that an effective way to incorporate contextual information is through a patch-based classifier. This patch classifier is trained to identify classes present within an image region, which facilitates the elimination of distractors and enhances the classification of small object segments. Specifically, we introduce Multi-scale Patch-based Multi-label Classifier (MPMC), a novel plug-in module designed for existing semi-supervised segmentation (SSS) frameworks. MPMC offers patch-level supervision, enabling the discrimination of pixel regions of different classes within a patch. Furthermore, MPMC learns an adaptive pseudo-label weight, using patch-level classification to alleviate the impact of the teacher’s noisy pseudo-label supervision on the student. This lightweight module can be integrated into any SSS framework, significantly enhancing their performance. We demonstrate the efficacy of our proposed MPMC by integrating it into four SSS methodologies and improving them across two natural image and one medical segmentation dataset, notably improving the segmentation results of the baselines across all the three datasets. Code is available at: https://github.com/cvlab-stonybrook/Beyond-Pixels.

  • Details
  • Metrics
Type
conference paper
DOI
10.1007/978-3-031-73226-3_20
Scopus ID

2-s2.0-85209927510

Author(s)
Howlader, Prantik

Stony Brook University

Das, Srijan

The University of North Carolina at Charlotte

Le, Hieu  

École Polytechnique Fédérale de Lausanne

Samaras, Dimitris

Stony Brook University

Editors
Leonardis, Aleš
•
Ricci, Elisa
•
Roth, Stefan
•
Russakovsky, Olga
•
Sattler, Torsten
•
Varol, Gül
Date Issued

2025

Publisher

Springer Science and Business Media Deutschland GmbH

Published in
Computer Vision – ECCV 2024 - 18th European Conference, Proceedings
Series title/Series vol.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 15133 LNCS

ISSN (of the series)

1611-3349

0302-9743

Start page

342

End page

360

Subjects

Patches

•

Segmentation

•

Semi-supervised

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
CVLAB  
Event nameEvent acronymEvent placeEvent date
18th European Conference on Computer Vision

Milan, Italy

2024-09-29 - 2024-10-04

FunderFunding(s)Grant NumberGrant URL

National Science Foundation

IIS-2123920,IIS-2212046,IIS-2245652

Available on Infoscience
January 26, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/244629
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés