Abstract
The use of content features extracted from recognized text is valuable in labeling logical elements in documents without rigid layout structure, like business letters. This paper discusses a model-based approach to combining content features with other geometrical and presentation features for logical labeling. Models are automatically initialized and adaptively improved using training samples. Satisfactory experimental results are presented.
| Original language | English |
|---|---|
| Pages (from-to) | 189-196 |
| Number of pages | 8 |
| Journal | Proceedings of SPIE - The International Society for Optical Engineering |
| Volume | 5010 |
| DOIs | |
| State | Published - 2003 |
| Event | Document Recognition and Retrieval X - Santa Clara, CA, United States Duration: Jan 22 2003 → Jan 24 2003 |
Keywords
- Content feature
- Logical document labeling
Fingerprint
Dive into the research topics of 'Content features for logical document labeling'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver