dhSegment: A generic deep-learning approach for document segmentation

Oliveira, Sofia Ares; Seguin, Benoit; Kaplan, Frederic

doi:10.1109/ICFHR-2018.2018.00011

Oliveira, Sofia Ares; Seguin, Benoit; Kaplan, Frederic

2018

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

In recent years there have been multiple successful attempts tackling document processing problems separately by designing task specific hand-tuned strategies. We argue that the diversity of historical document processing tasks prohibits to solve them one at a time and shows a need for designing generic approaches in order to handle the variability of historical series. In this paper, we address multiple tasks simultaneously such as page extraction, baseline extraction, layout analysis or multiple typologies of illustrations and photograph extraction. We propose an open-source implementation of a CNN-based pixel-wise predictor coupled with task dependent post-processing blocks. We show that a single CNN-architecture can be used across tasks with competitive results. Moreover most of the task-specific post-precessing steps can be decomposed in a small number of simple and standard reusable operations, adding to the flexibility of our approach.

Details

Title dhSegment: A generic deep-learning approach for document segmentation

Author(s) Oliveira, Sofia Ares ; Seguin, Benoit ; Kaplan, Frederic

Published in Proceedings 2018 16Th International Conference On Frontiers In Handwriting Recognition (Icfhr)

Series International Conference on Handwriting Recognition

Pages 7-12

Conference 16th International Conference on Frontiers in Handwriting Recognition (ICFHR), Niagara Falls, NY, Aug 05-08, 2018

Date 2018-01-01

Publisher New York, IEEE

ISSN 2167-6445

ISBN 978-1-5386-5875-8

Keywords

document segmentation; historical document processing; document layout analysis; neural network; deep learning

DOI https://doi.org/10.1109/ICFHR-2018.2018.00011

Other identifier(s) View record in Web of Science

Laboratories DHLAB

Record Appears in Scientific production and competences > CDH - College of Humanities and social sciences > Digital Humanities Institute > DHLAB - Digital Humanities Laboratory
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Record creation date 2019-01-23