Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Page Layout Analysis of Text-heavy Historical Documents: a Comparison of Textual and Visual Approaches
 
conference paper

Page Layout Analysis of Text-heavy Historical Documents: a Comparison of Textual and Visual Approaches

Najem-Meyer, Sven  
•
Romanello, Matteo  
Karsdorp, Folgert
•
Lassche, Alie
Show more
December 12, 2022
Proceedings of the Computational Humanities Research Conference 2022 Antwerp, Belgium, December 12-14, 2022
Third Conference on Computational Humanities Research (CHR 2022)

Page layout analysis is a fundamental step in document processing which enables to segment a page into regions of interest. With highly complex layouts and mixed scripts, scholarly commentaries are text-heavy documents which remain challenging for state-of-the-art models. Their layout considerably varies across editions and their most important regions are mainly defined by semantic rather than graphical characteristics such as position or appearance. This setting calls for a comparison between textual, visual and hybrid approaches. We therefore assess the performances of two transformers (LayoutLMv3 and RoBERTa) and an objection-detection network (YOLOv5). If results show a clear advantage in favor of the latter, we also list several caveats to this finding. In addition to our experiments, we release a dataset of ca. 300 annotated pages sampled from 19th century commentaries.

  • Files
  • Details
  • Metrics
Type
conference paper
DOI
10.48550/arXiv.2212.13924
Author(s)
Najem-Meyer, Sven  
Romanello, Matteo  
Editors
Karsdorp, Folgert
•
Lassche, Alie
•
Nielbo, Kristoffer
Date Issued

2022-12-12

Published in
Proceedings of the Computational Humanities Research Conference 2022 Antwerp, Belgium, December 12-14, 2022
Series title/Series vol.

CEUR Workshop Proceedings; 3290

Start page

36

End page

54

Subjects

Artificial Intelligence

•

Computation and Language

•

Computer Vision and Pattern Recognition

•

Layout Analysis

•

Historical Documents

•

Classical Studies

URL

Online Proceedings

https://ceur-ws.org/Vol-3290/
Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
DHLAB  
Event nameEvent placeEvent date
Third Conference on Computational Humanities Research (CHR 2022)

Antwerp, Belgium

December 12-14, 2022

RelationURL/DOI

IsSupplementedBy

https://github.com/AjaxMultiCommentary/GT-commentaries-layout
Available on Infoscience
January 26, 2023
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/194331
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés