Skip to main navigation Skip to search Skip to main content

Pre-processing methods for handwritten Arabic documents

  • SUNY Buffalo
  • IBM

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

78 Scopus citations

Abstract

In order to improve the readability and the automatic recognition of handwritten document images, preprocessing steps are imperative. These steps in addition to conventional steps of noise removal and filtering include text normalization such as baseline correction, slant normalization and skew correction. These steps make the feature extraction process more reliable and effective. Recently Arabic handwriting recognition has received some attention from the research community. Due to the unique nature of the script, the conventional methods do not prove to be effective. In our work, we describe an orientation independent technique for baseline detection of Arabic words. In addition to that we describe, in the rest of the paper, our techniques for slant normalization, slope correction, line and word separation in handwritten Arabic documents. We show how the baseline can be exploited for slope and skew correction before proceeding with the steps of line and word separation.

Original languageEnglish
Title of host publicationProceedings of the Eighth International Conference on Document Analysis and Recognition
Pages267-271
Number of pages5
DOIs
StatePublished - 2005
Event8th International Conference on Document Analysis and Recognition - Seoul, Korea, Republic of
Duration: Aug 31 2005Sep 1 2005

Publication series

NameProceedings of the International Conference on Document Analysis and Recognition, ICDAR
Volume2005
ISSN (Print)1520-5363

Conference

Conference8th International Conference on Document Analysis and Recognition
Country/TerritoryKorea, Republic of
CitySeoul
Period08/31/0509/1/05

Fingerprint

Dive into the research topics of 'Pre-processing methods for handwritten Arabic documents'. Together they form a unique fingerprint.

Cite this