Skip to main navigation Skip to search Skip to main content

Handwritten Arabic text line segmentation using Affinity propagation

  • University of Maryland, College Park

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

55 Scopus citations

Abstract

In this paper, we present a novel graph-based method for extracting handwritten text lines in monochromatic Ara- bic document images. Our approach consists of two steps - Coarse text line estimation using primary components which define the line and assignment of diacritic components which are more difficult to associate with a given line. We first esti- mate local orientation at each primary component to build a sparse similarity graph. We then, use a shortest path algorithm to compute similarities between non-neighboring components. From this graph, we obtain coarse text lines using two estimates obtained from Affinity propagation and Breadth-first search. In the second step, we assign secondary components to each text line. The proposed method is very fast and robust to non-uniform skew and character size variations, normally present in handwritten text lines. We evaluate our method using a pixel-matching criteria, and report 96% accuracy on a dataset of 125 Arabic document images. We also present a proximity analysis on datasets generated by artificially decreasing the spacings between text lines to demonstrate the robustness of our approach.

Original languageEnglish
Title of host publicationProceedings of the 9th IAPR International Workshop on Document Analysis Systems, DAS '10
Pages135-142
Number of pages8
DOIs
StatePublished - 2010
Event2010 IAPR Workshop on Document Analysis Systems, DAS 2010 - Boston, MA, United States
Duration: Jun 9 2010Jun 11 2010

Publication series

NameACM International Conference Proceeding Series

Conference

Conference2010 IAPR Workshop on Document Analysis Systems, DAS 2010
Country/TerritoryUnited States
CityBoston, MA
Period06/9/1006/11/10

Keywords

  • Arabic documents
  • Handwritten documents
  • Text line segmentation

Fingerprint

Dive into the research topics of 'Handwritten Arabic text line segmentation using Affinity propagation'. Together they form a unique fingerprint.

Cite this