Skip to main navigation Skip to search Skip to main content

Transcript mapping for handwritten English documents

  • SUNY Buffalo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

Transcript mapping or text alignment with handwritten documents is the automatic alignment of words in a text file with word images in a handwritten document. Such a mapping has several applications in fields ranging from machine learning where large quantities of truth data are required for evaluating handwriting recognition algorithms, to data mining where word image indexes are used in ranked retrieval of scanned documents in a digital library. The alignment also aids "writer identity" verification algorithms. Interfaces which display scanned handwritten documents may use this alignment to highlight manuscript tokens when a person examines the corresponding transcript word. We propose an adaptation of the True DTW dynamic programming algorithm for English handwritten documents. The integration of the dissimilarity scores from a word-model word recognizer and Levenshtein distance between the recognized word and lexicon word, as a cost metric in the DTW algorithm leading to a fast and accurate alignment, is our primary contribution. Results provided, confirm the effectiveness of our approach.

Original languageEnglish
Title of host publicationDocument Recognition and Retrieval XV
DOIs
StatePublished - 2008
EventDocument Recognition and Retrieval XV - San Jose, CA, United States
Duration: Jan 29 2008Jan 31 2008

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume6815
ISSN (Print)0277-786X

Conference

ConferenceDocument Recognition and Retrieval XV
Country/TerritoryUnited States
CitySan Jose, CA
Period01/29/0801/31/08

Keywords

  • Levenshtein distance
  • Mapping
  • Segmentation
  • Transcript
  • True DTW

Fingerprint

Dive into the research topics of 'Transcript mapping for handwritten English documents'. Together they form a unique fingerprint.

Cite this