Seam Carving for Text Line Extraction on Color and Grayscale Historical Manuscripts

We propose a novel algorithm for automatic text line extraction on color and grayscale manuscript pages without prior binarization. Our algorithm is based on seam carving to compute separating seams between text lines. Seam carving is likely to produce seams that move through gaps between neighboring lines, if no information about the text geometry is incorporated into the problem. By constraining the optimization procedure inside the region between two consecutive text lines, we can produce robust separating seams that do not cut through word and line components. Extensive experimental evaluations on diverse manuscript pages show that we improve upon the state-of-the-art for grayscale text line extraction.

Presented at:
14th International Conference on Frontiers in Handwriting Recognition (ICFHR), Crete, Greece, September 1-4, 2014

 Record created 2014-05-13, last modified 2018-03-17

Download fulltextPDF
External link:
Download fulltextURL
Rate this document:

Rate this document:
(Not yet reviewed)