Skip to main navigation Skip to search Skip to main content

Script independentword spotting in offline handwritten documents based on Hidden Markov Models

  • SUNY Buffalo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

41 Scopus citations

Abstract

Keyword spotting aims to retrieve all instances of a given keyword from a document in any language. In this paper, we propose a novel script independent line based word spotting framework for offline handwritten documents based on Hidden Markov Models. The methodology simulates the keywords in model space as a sequence of character models and uses the filler models for better representation of background or non-keyword text. We propose a two stage spotting framework where the candidate keywords are further pruned using the character based background and lexicon based background model. The system deals with large vocabulary without the need for word or character segmentation. The system has been evaluated on many public dataset from several languages such as IAM for English, AMA for Arabic and LAW for Devanagari. The system outperforms the modern line based approach on the English, Arabic and Devanagari Datasets.

Original languageEnglish
Title of host publicationProceedings - 13th International Conference on Frontiers in Handwriting Recognition, ICFHR 2012
Pages14-19
Number of pages6
DOIs
StatePublished - 2012
Event13th International Conference on Frontiers in Handwriting Recognition, ICFHR 2012 - Bari, Italy
Duration: Sep 18 2012Sep 20 2012

Publication series

NameProceedings - International Workshop on Frontiers in Handwriting Recognition, IWFHR
ISSN (Print)1550-5235

Conference

Conference13th International Conference on Frontiers in Handwriting Recognition, ICFHR 2012
Country/TerritoryItaly
CityBari
Period09/18/1209/20/12

Fingerprint

Dive into the research topics of 'Script independentword spotting in offline handwritten documents based on Hidden Markov Models'. Together they form a unique fingerprint.

Cite this