TY - GEN
T1 - Multilingual word spotting in offline handwritten documents
AU - Wshah, Safwan
AU - Kumar, Gaurav
AU - Govindaraju, Venu
PY - 2012
Y1 - 2012
N2 - In this work, we propose a novel multilingual word spotting framework based on Hidden Markov Models that works on corpus of multilingual handwritten documents and documents that contain more than one handwritten script. The system deals with large multilingual vocabularies without need for word or character segmentation. A keyword is represented by concatenating its character models. We propose and compare two systems: a script identifier based (IDB) and a script identifier free (IDF) system. IDB uses a HMM based script identifier before spotting a keyword. While, IDF does the spotting without the script identification. The system is evaluated on a mixed corpus of public dataset from several scripts such as IAM for English, AMA for Arabic and LAW for Devanagari and on synthetic dataset generated by concatenating words and lines from different scripts in a document image.
AB - In this work, we propose a novel multilingual word spotting framework based on Hidden Markov Models that works on corpus of multilingual handwritten documents and documents that contain more than one handwritten script. The system deals with large multilingual vocabularies without need for word or character segmentation. A keyword is represented by concatenating its character models. We propose and compare two systems: a script identifier based (IDB) and a script identifier free (IDF) system. IDB uses a HMM based script identifier before spotting a keyword. While, IDF does the spotting without the script identification. The system is evaluated on a mixed corpus of public dataset from several scripts such as IAM for English, AMA for Arabic and LAW for Devanagari and on synthetic dataset generated by concatenating words and lines from different scripts in a document image.
UR - https://www.scopus.com/pages/publications/84874559219
M3 - Conference contribution
AN - SCOPUS:84874559219
SN - 9784990644109
T3 - Proceedings - International Conference on Pattern Recognition
SP - 310
EP - 313
BT - ICPR 2012 - 21st International Conference on Pattern Recognition
T2 - 21st International Conference on Pattern Recognition, ICPR 2012
Y2 - 11 November 2012 through 15 November 2012
ER -