Comic Digitization through the Extraction of Semantic Content and Style Analysis

Lenadora, Damitha; Ranathunge, Rakhitha; Samarawickrama, Chamath; De Silva, Yumantha; Perera, Indika; Welivita, Anuradha

doi:10.1109/ICTer48817.2019.9023647

conference paper

Comic Digitization through the Extraction of Semantic Content and Style Analysis

Lenadora, Damitha

•

Ranathunge, Rakhitha

•

Samarawickrama, Chamath

January 1, 2019

2019 19Th International Conference On Advances In Ict For Emerging Regions (Icter - 2019)

19th International Conference on Advances in ICT for Emerging Regions (ICTer)

Comic book digitization would play a pivotal role in exploring new avenues on how digital comics can be consumed. As of present, the systems capable of doing such a task are limited in capability to achieve complete digitization. This task of digitization requires the understanding of the content within comic books, which can be drawn from sub-tasks such as identification and extraction of comic book content, extraction and analysis of texts, derivation of character-speech balloon associations and analysis of reading styles. In this paper, first, an analysis of the usage of several object detection models for detecting semantic elements is presented. Under the constraint of limited computational power, this analysis revealed that YOLOv3 was the most suited out of the models evaluated. Then, a particular focus is given to the analysis of extraction and recognition of texts utilizing Optical Character Recognition, along with distance-based methods for deriving associable speech balloons as well as character and speech balloon associations under given constraints. The presented association method gave an improved accuracy relative to the Euclidean distance-based method. Finally, an analysis of comic styles is presented along with a learning model to determine the reading order of comics with an accuracy of 0.89.

Type

conference paper

DOI

10.1109/ICTer48817.2019.9023647

Web of Science ID

WOS:000556570300001

Author(s)

Lenadora, Damitha

Ranathunge, Rakhitha

Samarawickrama, Chamath

De Silva, Yumantha

Perera, Indika

Welivita, Anuradha

Date Issued

2019-01-01

Publisher

IEEE

Publisher place

New York

Published in

2019 19Th International Conference On Advances In Ict For Emerging Regions (Icter - 2019)

ISBN of the book

978-1-7281-5156-4

Series title/Series vol.

International Conference on Advances in ICT for Emerging Regions

Subjects

comics

•

digitization

•

content detection

•

text recognition

•

speech balloon to character association

•

comic styles

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units

GR-PU

LIA

Event name	Event place	Event date
19th International Conference on Advances in ICT for Emerging Regions (ICTer)	Colombo, SRI LANKA	Sep 03-04, 2019

Available on Infoscience

August 21, 2020

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/171010