Skip to main navigation Skip to search Skip to main content

Image enhancement for degraded binary document images

  • SUNY Buffalo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

15 Scopus citations

Abstract

This paper presents a novel set of image enhancement algorithms for binary images of poorly scanned real world page documents. Problems that are targeted by the methods described include large blobs or clutter noise, salt-and-pepper noise and detection and removal of non-text objects such as form lines or rule-lines. The algorithms described are shown to be very effective in removing clutter noise and pepper noise as well as form lines and rule-lines. A region growing algorithm is also described to enhance the quality of the text and to fix the problems arising from the salt noise which leaves holes in the text and creates broken strokes. The methods were tested on 204 images from the challenge set of the DARPA MADCAT Arabic handwritten document image data. The results indicate that the methods described are robust and are capable of significantly improving the image quality for downstream OCR systems.

Original languageEnglish
Title of host publicationProceedings - 11th International Conference on Document Analysis and Recognition, ICDAR 2011
Pages895-899
Number of pages5
DOIs
StatePublished - 2011
Event11th International Conference on Document Analysis and Recognition, ICDAR 2011 - Beijing, China
Duration: Sep 18 2011Sep 21 2011

Publication series

NameProceedings of the International Conference on Document Analysis and Recognition, ICDAR
ISSN (Print)1520-5363

Conference

Conference11th International Conference on Document Analysis and Recognition, ICDAR 2011
Country/TerritoryChina
CityBeijing
Period09/18/1109/21/11

Fingerprint

Dive into the research topics of 'Image enhancement for degraded binary document images'. Together they form a unique fingerprint.

Cite this