Multi-scale sequential network for semantic text segmentation and localization

Villamizar, Michael; Canévet, Olivier; Odobez, Jean-Marc

doi:10.1016/j.patrec.2019.11.001

Villamizar, Michael; Canévet, Olivier; Odobez, Jean-Marc

2020

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

We present a novel method for semantic text document analysis which in addition to localizing text it labels the text in user-defined semantic categories. More precisely, it consists of a fully-convolutional and sequential network that we apply to the particular case of slide analysis to detect title, bullets and standard text. Our contributions are twofold: (1) A multi-scale network consisting of a series of stages that sequentially refine the prediction of text and semantic labels (text, title, bullet); (2) A synthetic database of slide images with text and semantic annotation that is used to train the network with abundant data and wide variability in text appearance, slide layouts, and noise such as compression artifacts. We evaluate our method on a collection of real slide images collected from multiple conferences, and show that it is able to localize text with an accuracy of 95%, and to classify titles and bullets with accuracies of 94% and 85% respectively. In addition, we show that our method is competitive on scene and born-digital image datasets, such as ICDAR 2011, where it achieves an accuracy of 91.1%.

Details

Title Multi-scale sequential network for semantic text segmentation and localization

Author(s) Villamizar, Michael ; Canévet, Olivier ; Odobez, Jean-Marc

Published in Pattern Recognition Letters

Volume 129

Pages 63-69

Date 2020

ISSN 0167-8655

DOI https://doi.org/10.1016/j.patrec.2019.11.001

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2020-02-18

Abstract

Details

Actions