Skip to main navigation Skip to search Skip to main content

Multi-scale techniques for document page segmentation

  • SUNY Buffalo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

17 Scopus citations

Abstract

Page segmentation algorithms found in published literatures often rely on some predetermined parameters such as general font sizes, distances between text lines and document scan resolutions. Variations of these parameters in real document images greatly affect the performance of the algorithms. In this paper we present a novel approach for document page segmentation using a multi-scale technique. An efficient implementation of a local connectivity algorithm transforms a document image into a parameter domain in which a parameter value at a pixel location represents a connectivity property for its neighboring foreground pixels in the original document image. Then a top-down approach with a linear search reveals the document regions at each scale levels as text block, text lines and graphics. We consider our algorithm a transform based multi-scale method. Our ongoing research shows that the algorithm is robust for variations of document parameters.

Original languageEnglish
Title of host publicationProceedings of the Eighth International Conference on Document Analysis and Recognition
Pages1020-1024
Number of pages5
DOIs
StatePublished - 2005
Event8th International Conference on Document Analysis and Recognition - Seoul, Korea, Republic of
Duration: Aug 31 2005Sep 1 2005

Publication series

NameProceedings of the International Conference on Document Analysis and Recognition, ICDAR
Volume2005
ISSN (Print)1520-5363

Conference

Conference8th International Conference on Document Analysis and Recognition
Country/TerritoryKorea, Republic of
CitySeoul
Period08/31/0509/1/05

Fingerprint

Dive into the research topics of 'Multi-scale techniques for document page segmentation'. Together they form a unique fingerprint.

Cite this