Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Unlocking Comics: the Ai4va Dataset for Visual Understanding
 
conference paper

Unlocking Comics: the Ai4va Dataset for Visual Understanding

Gronquist, Peter  
•
Bhattacharjee, Deblina  
•
Aydemir, Bahar  
Show more
DelBue, A
•
Canton, C
Show more
May 12, 2025
Computer Vision – ECCV 2024 Workshops Milan, Italy, September 29–October 4, 2024, Proceedings
18th European Conference on Computer Vision

In the evolving landscape of deep learning, there is a pressing need for more comprehensive datasets capable of training models across multiple modalities. Concurrently, in digital humanities, there is a growing demand to leverage technology for diverse media adaptation and creation, yet limited by sparse datasets due to copyright and stylistic constraints. Addressing this gap, our paper presents a novel dataset comprising Franco-Belgian comics from the 1950s annotated for tasks including depth estimation, semantic segmentation, saliency detection, and character identification. It consists of two distinct and consistent styles and incorporates object concepts and labels taken from natural images. By including such diverse information across styles, this dataset not only holds promise for computational creativity but also offers avenues for the digitization of art and storytelling innovation. This dataset is a crucial component of the AI4VA Workshop Challenges https://sites.google.com/view/ai4vaeccv2024, where we specifically explore depth and saliency. Dataset details at https://github.com/IVRL/AI4VA (Work done when PG, DB, BA, and BO were at EPFL and supported in part by the Swiss National Science Foundation via the Sinergia grant CRSII5-180359.).

  • Details
  • Metrics
Type
conference paper
DOI
10.1007/978-3-031-92808-6_10
Web of Science ID

WOS:001544980800017

Author(s)
Gronquist, Peter  

École Polytechnique Fédérale de Lausanne

Bhattacharjee, Deblina  

École Polytechnique Fédérale de Lausanne

Aydemir, Bahar  

École Polytechnique Fédérale de Lausanne

Ozaydin, Baran  

École Polytechnique Fédérale de Lausanne

Zhang, Tong

École Polytechnique Fédérale de Lausanne

Salzmann, Mathieu  

École Polytechnique Fédérale de Lausanne

Susstrunk, Sabine  

École Polytechnique Fédérale de Lausanne

Editors
DelBue, A
•
Canton, C
•
Pont-Tuset, J
•
Tommasi, T
Date Issued

2025-05-12

Publisher

Springer Nature

Publisher place

Cham

Published in
Computer Vision – ECCV 2024 Workshops Milan, Italy, September 29–October 4, 2024, Proceedings
ISBN of the book

978-3-031-92807-9

978-3-031-92808-6

Series title/Series vol.

Lecture Notes in Computer Science; 15627

ISSN (of the series)

0302-9743

1611-3349

Start page

155

End page

172

Subjects

Semantic segmentation

•

Depth estimation

•

Saliency detection

•

Comics

•

Dataset

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
CVLAB  
IVRL  
Event nameEvent acronymEvent placeEvent date
18th European Conference on Computer Vision

ECCV 2024

Milan, Italy

2024-09-29 - 2024-10-04

FunderFunding(s)Grant NumberGrant URL

Swiss National Science Foundation (SNSF)

CRSII5-180359

Available on Infoscience
September 19, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/254204
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés