Skip to main navigation Skip to search Skip to main content

Geometric rectification of camera-captured document images

  • Amazon.com, Inc.
  • University of Maryland, College Park

Research output: Contribution to journalArticlepeer-review

154 Scopus citations

Abstract

Compared to typical scanners, handheld cameras offer convenient, flexible, portable, and non-contactimage capture, which enables many new applications and breathes new life into existing ones. However, camera-captured documents may suffer from distortions caused by non-planar document shape and perspective projection, which lead to failure of current OCR technologies. We present a geometric rectification framework for restoring the frontal-flat view of a document from a single camera-captured image. Our approach estimates 3D document shape from texture flow information obtained directly from the image without requiring additional 3D/metric data or prior camera calibration. Our framework provides a unified solution for both planar and curved documents and can be applied in many, especially mobile, camera-based document analysis applications. Experiments show that our method produces results that are significantly more OCR compatible than the original images.

Original languageEnglish
Pages (from-to)591-605
Number of pages15
JournalIEEE Transactions on Pattern Analysis and Machine Intelligence
Volume30
Issue number4
DOIs
StatePublished - Apr 2008

Keywords

  • Camera-based OCR
  • Image rectification
  • Shape estimation
  • Texture flow analysis

Fingerprint

Dive into the research topics of 'Geometric rectification of camera-captured document images'. Together they form a unique fingerprint.

Cite this