Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Label Critic: Design Data Before Models
 
conference paper

Label Critic: Design Data Before Models

Bassi, Pedro R.A.S.
•
Wu, Qilong
•
Li, Wenxuan
Show more
2025
2025 IEEE 22nd International Symposium on Biomedical Imaging (ISBI)
2025 IEEE 22nd International Symposium on Biomedical Imaging (ISBI)

As medical datasets rapidly expand, creating detailed annotations of different body structures becomes increasingly expensive and time-consuming. We consider that requesting radiologists to create detailed annotations is unnecessarily burdensome and that pre-existing AI models can largely automate this process. Following the spirit don't use a sledgehammer on a nut, we find that, rather than creating annotations from scratch, radiologists only have to review and edit errors if the Best-AI Labels have mistakes. To obtain the Best-AI Labels among multiple AI Labels, we developed an automatic tool, called Label Critic, that can assess label quality through tireless pairwise comparisons. Extensive experiments demonstrate that, when incorporated with our developed Image-Prompt pairs, pre-existing Large Vision-Language Models (LVLM), trained on natural images and texts, achieve 96.5% accuracy when choosing the best label in a pair-wise comparison, without extra fine-tuning. By transforming the manual annotation task (30-60 min/scan) into an automatic comparison task (15 sec/scan), we effectively reduce the manual efforts required from radiologists by an order of magnitude. When the Best-AI Labels are sufficiently accurate (81% depending on body structures), they will be directly adopted as the gold-standard annotations for the dataset, with lower-quality AI Labels automatically discarded. Label Critic can also check the label quality of a single AI Label with 71.8% accuracy when no alternatives are available for comparison, prompting radiologists to review and edit if the estimated quality is low (19% depending on body structures).

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ISBI60581.2025.10981154
Scopus ID

2-s2.0-105005831771

Author(s)
Bassi, Pedro R.A.S.

Johns Hopkins University

Wu, Qilong

Johns Hopkins University

Li, Wenxuan

Johns Hopkins University

Decherchi, Sergio

Istituto Italiano di Tecnologia

Cavalli, Andrea  

EPFL

Yuille, Alan

Johns Hopkins University

Zhou, Zongwei

Johns Hopkins University

Date Issued

2025

Publisher

IEEE Computer Society

Published in
2025 IEEE 22nd International Symposium on Biomedical Imaging (ISBI)
DOI of the book
https://doi.org/10.1109/ISBI60581.2025
ISBN of the book

979-8-3315-2052-6

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
CECAM-GE  
Event nameEvent acronymEvent placeEvent date
2025 IEEE 22nd International Symposium on Biomedical Imaging (ISBI)

ISBI 2025

Houston, TX, USA

2025-04-14 - 2025-04-17

FunderFunding(s)Grant NumberGrant URL

Mc-Govern Foundation

Lustgarten Foundation

Italian Institute of Technology

16163,73010

Available on Infoscience
June 2, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/250929
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés