Skip to main navigation Skip to search Skip to main content

VisualDiff: Document image verification and change detection

  • University of Maryland, College Park

Research output: Contribution to journalConference articlepeer-review

14 Scopus citations

Abstract

This paper explores the related problems of verification and change detection in document images. The goal is to determine if two document images differ, and if so, to determine precisely what content may have been added, deleted, or otherwise modified. This problem has many potential applications, especially for important legal documents such as contractual agreements. These agreements are often edited, shared and stored as scanned or hardcopy documents, where small, undetected changes between edits could create major differences in the contractual language and thus have severe repercussions. One can view the problem of change detection as tracing the revision history of a set of documents. Thus, in order to validate the performance of this approach, we created the 'Enron Revisions' dataset. This dataset contains realistic revisions obtained from attachments in the Enron Corpus, and a series of before and after snapshots of the revisions in images with varying levels of noise from resolution, binarization, and blur. The approach taken in this paper utilizes the SIFT descriptor to align two document images without the benefit of OCR and once aligned, to compare dense descriptors to determine changes that have occurred within the image. As a baseline, this 'VisualDiff' is compared to a UNIX diff-like approach on text extracted through OCR and results demonstrate the effectiveness of this approach.

Original languageEnglish
Article number6628582
Pages (from-to)40-44
Number of pages5
JournalProceedings of the International Conference on Document Analysis and Recognition, ICDAR
DOIs
StatePublished - 2013
Event12th International Conference on Document Analysis and Recognition, ICDAR 2013 - Washington, DC, United States
Duration: Aug 25 2013Aug 28 2013

Keywords

  • Change Detection
  • Document Image
  • Document Verification

Fingerprint

Dive into the research topics of 'VisualDiff: Document image verification and change detection'. Together they form a unique fingerprint.

Cite this