A Handwritten French Dataset for Word Spotting - CFRAMUZ

We present a new and freely available dataset, CFRAMUZ, for segmentation-free word spotting research. The dataset consists of seven novels with a total number of 64 pages and 18000 words written in french by the Swiss writer C.F. Ramuz. The novels cover the writer’s whole period of life, therefore they show changes in the handwriting style. Together with the complete ground-truth of the dataset we provide an annotation tool. We provide evaluations of state-of-the-art word spotting approaches on this dataset. For completeness we also compare all the approaches on other commonly used datasets to demonstrate the new difficulties and challenges our new dataset introduces.


Presented at:
The 4th International Workshop on Historical Document Imaging and Processing (HIP 2017), Kyoto, Japan, November 10-11, 2017
Year:
2017
Keywords:
Laboratories:




 Record created 2017-10-31, last modified 2018-12-03

n/a:
Download fulltext
PDF

Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)