Skip to main navigation Skip to search Skip to main content

Overlapped text segmentation using markov random field and aggregation

  • SUNY Buffalo
  • Hewlett-Packard

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Scopus citations

Abstract

Separating machine printed text and handwriting from overlapping text is a challenging problem in the document analysis field and no reliable algorithms have been developed thus far. In this paper, we propose a novel approach for separating handwriting from binary image of overlapped text. Instead of using fixed size training patches, we describe an aggregation method which uses shape context features to extract training samples automatically. We use a Markov Random Field (MRF) to model the over-lapped text. The neighbor system is inherited from a coarsening procedure and the prior and likelihood of the MRF is learned based on a distance metric. Experimental results show that the proposed method can achieve 87.97% recall for handwriting and 91.44% recall for machine printed text.

Original languageEnglish
Title of host publicationProceedings of the 9th IAPR International Workshop on Document Analysis Systems, DAS '10
Pages129-134
Number of pages6
DOIs
StatePublished - 2010
Event2010 IAPR Workshop on Document Analysis Systems, DAS 2010 - Boston, MA, United States
Duration: Jun 9 2010Jun 11 2010

Publication series

NameACM International Conference Proceeding Series

Conference

Conference2010 IAPR Workshop on Document Analysis Systems, DAS 2010
Country/TerritoryUnited States
CityBoston, MA
Period06/9/1006/11/10

Fingerprint

Dive into the research topics of 'Overlapped text segmentation using markov random field and aggregation'. Together they form a unique fingerprint.

Cite this