Offline Recognition of Unconstrained Handwritten Texts Using HMMs and Statistical Language Models

Vinciarelli, Alessandro; Bengio, Samy; Bunke, Horst

doi:10.1109/TPAMI.2004.14

Vinciarelli, Alessandro; Bengio, Samy; Bunke, Horst

2004

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

This paper presents a system for the offline recognition of large vocabulary unconstrained handwritten texts. The only assumption made about the data is that it is written in English. This allows the application of Statistical Language Models in order to improve the performance of our system. Several experiments have been performed using both single and multiple writer data. Lexica of variable size (from 10,000 to 50,000 words) have been used. The use of language models is shown to improve the accuracy of the system (when the lexicon contains 50,000 words, error rate is reduced by 50% for single writer data and by 25% for multiple writer data). Our approach is described in detail and compared with other methods presented in the literature to deal with the same problem. An experimental setup to correctly deal with unconstrained text recognition is proposed.

Details

Title Offline Recognition of Unconstrained Handwritten Texts Using HMMs and Statistical Language Models

Author(s) Vinciarelli, Alessandro ; Bengio, Samy ; Bunke, Horst

Published in IEEE Transactions on Pattern Analysis and Machine Intelligence

Volume 26

Issue 6

Pages 709-720

Date 2004

Keywords

vision

Note IDIAP-RR 03-22

DOI https://doi.org/10.1109/TPAMI.2004.14

Additional link URL; Related documents

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2006-03-10

Actions

Preview

Select file: