Skip to main navigation Skip to search Skip to main content

Dynamic local connectivity and its application to page segmentation

  • SUNY Buffalo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Page segmentation is one of the important stage in most document processing systems. Algorithms found in published literatures often rely on some predetermined parameters such as general font sizes, distances between text lines and document scan resolutions. Variations of these parameters in real document images greatly affect the performance of the algorithms. In this paper we present a novel approach for document page segmentation using dynamic local connectivity transform. An efficient implementation of a local connectivity algorithm transforms a document image into a parameter domain in which a parameter value at a pixel location represents a connectivity property for its neighboring foreground pixels in the original document image. Then a top-down approach with a linear search reveals the document regions at each resolution levels as text block, text lines and graphics. We consider our algorithm a transform based multi-resolution method. Our ongoing research shows that the algorithm is robust for variations of document parameters.

Original languageEnglish
Title of host publicationHDP 2004
Subtitle of host publicationProceedings of the First ACM Hardcopy Document Processing Workshop
PublisherAssociation for Computing Machinery (ACM)
Pages47-51
Number of pages5
ISBN (Print)1581139764, 9781581139761
DOIs
StatePublished - 2004
EventHDP 2004: Proceedings of the First ACM Hardcopy Document Processing Workshop - Washington, DC, United States
Duration: Nov 12 2004Nov 12 2004

Publication series

NameHDP 2004: Proceedings of the First ACM Hardcopy Document Processing Workshop

Conference

ConferenceHDP 2004: Proceedings of the First ACM Hardcopy Document Processing Workshop
Country/TerritoryUnited States
CityWashington, DC
Period11/12/0411/12/04

Keywords

  • Character recognition
  • Document image analysis
  • Local connectivity
  • Multi-resolution
  • Page segmentation
  • Region identification

Fingerprint

Dive into the research topics of 'Dynamic local connectivity and its application to page segmentation'. Together they form a unique fingerprint.

Cite this