TY - GEN
T1 - Script independentword spotting in offline handwritten documents based on Hidden Markov Models
AU - Wshah, Safwan
AU - Kumar, Gaurav
AU - Govindaraju, Venu
PY - 2012
Y1 - 2012
N2 - Keyword spotting aims to retrieve all instances of a given keyword from a document in any language. In this paper, we propose a novel script independent line based word spotting framework for offline handwritten documents based on Hidden Markov Models. The methodology simulates the keywords in model space as a sequence of character models and uses the filler models for better representation of background or non-keyword text. We propose a two stage spotting framework where the candidate keywords are further pruned using the character based background and lexicon based background model. The system deals with large vocabulary without the need for word or character segmentation. The system has been evaluated on many public dataset from several languages such as IAM for English, AMA for Arabic and LAW for Devanagari. The system outperforms the modern line based approach on the English, Arabic and Devanagari Datasets.
AB - Keyword spotting aims to retrieve all instances of a given keyword from a document in any language. In this paper, we propose a novel script independent line based word spotting framework for offline handwritten documents based on Hidden Markov Models. The methodology simulates the keywords in model space as a sequence of character models and uses the filler models for better representation of background or non-keyword text. We propose a two stage spotting framework where the candidate keywords are further pruned using the character based background and lexicon based background model. The system deals with large vocabulary without the need for word or character segmentation. The system has been evaluated on many public dataset from several languages such as IAM for English, AMA for Arabic and LAW for Devanagari. The system outperforms the modern line based approach on the English, Arabic and Devanagari Datasets.
UR - https://www.scopus.com/pages/publications/84874260261
U2 - 10.1109/ICFHR.2012.264
DO - 10.1109/ICFHR.2012.264
M3 - Conference contribution
AN - SCOPUS:84874260261
SN - 9780769547749
T3 - Proceedings - International Workshop on Frontiers in Handwriting Recognition, IWFHR
SP - 14
EP - 19
BT - Proceedings - 13th International Conference on Frontiers in Handwriting Recognition, ICFHR 2012
T2 - 13th International Conference on Frontiers in Handwriting Recognition, ICFHR 2012
Y2 - 18 September 2012 through 20 September 2012
ER -