Skip to main navigation Skip to search Skip to main content

Generalizing edit distance for handwritten text recognition

  • SUNY Buffalo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

12 Scopus citations

Abstract

In this paper the Damerau-Levenshtein string difference metric is generalized in two ways to more accurately compensate for the types of errors that are present in the script recognition domain. First, the basic dynamic programming method for computing such a measure is extended to allow for merges, splits and two-letter substitutions. Second, edit operations are refined into categories according to the effect they have on the visual `appearance' of words. A set of recognizer-independent constraints is developed to reflect the severity of the information lost due to each operation. These constraints are solved to assign specific costs to the operations. Experimental results on 2,335 corrupted strings and a lexicon of 21,299 words show higher correcting rates than with the original form.

Original languageEnglish
Title of host publicationProceedings of SPIE - The International Society for Optical Engineering
Pages54-65
Number of pages12
StatePublished - 1995

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume2422
ISSN (Print)0277-786X

Fingerprint

Dive into the research topics of 'Generalizing edit distance for handwritten text recognition'. Together they form a unique fingerprint.

Cite this