Abstract
Compared to typical scanners, handheld cameras offer convenient, flexible, portable, and non-contactimage capture, which enables many new applications and breathes new life into existing ones. However, camera-captured documents may suffer from distortions caused by non-planar document shape and perspective projection, which lead to failure of current OCR technologies. We present a geometric rectification framework for restoring the frontal-flat view of a document from a single camera-captured image. Our approach estimates 3D document shape from texture flow information obtained directly from the image without requiring additional 3D/metric data or prior camera calibration. Our framework provides a unified solution for both planar and curved documents and can be applied in many, especially mobile, camera-based document analysis applications. Experiments show that our method produces results that are significantly more OCR compatible than the original images.
| Original language | English |
|---|---|
| Pages (from-to) | 591-605 |
| Number of pages | 15 |
| Journal | IEEE Transactions on Pattern Analysis and Machine Intelligence |
| Volume | 30 |
| Issue number | 4 |
| DOIs | |
| State | Published - Apr 2008 |
Keywords
- Camera-based OCR
- Image rectification
- Shape estimation
- Texture flow analysis
Fingerprint
Dive into the research topics of 'Geometric rectification of camera-captured document images'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver