Skip to main navigation Skip to search Skip to main content

Content features for logical document labeling

  • University of Maryland, College Park

Research output: Contribution to journalConference articlepeer-review

7 Scopus citations

Abstract

The use of content features extracted from recognized text is valuable in labeling logical elements in documents without rigid layout structure, like business letters. This paper discusses a model-based approach to combining content features with other geometrical and presentation features for logical labeling. Models are automatically initialized and adaptively improved using training samples. Satisfactory experimental results are presented.

Original languageEnglish
Pages (from-to)189-196
Number of pages8
JournalProceedings of SPIE - The International Society for Optical Engineering
Volume5010
DOIs
StatePublished - 2003
EventDocument Recognition and Retrieval X - Santa Clara, CA, United States
Duration: Jan 22 2003Jan 24 2003

Keywords

  • Content feature
  • Logical document labeling

Fingerprint

Dive into the research topics of 'Content features for logical document labeling'. Together they form a unique fingerprint.

Cite this