Skip to main navigation Skip to search Skip to main content

Learning visual shape lexicon for document image content recognition

  • University of Maryland, College Park

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

Developing effective content recognition methods for diverse imagery continues to challenge computer vision researchers. We present a new approach for document image content categorization using a lexicon of shape features. Each lexical word corresponds to a scale and rotation invariant shape feature that is generic enough to be detected repeatably and segmentation free. We learn a concise, structurally indexed shape lexicon from training by clustering and partitioning feature types through graph cuts. We demonstrate our approach on two challenging document image content recognition problems: 1) The classification of 4,500 Web images crawled from Google Image Search into three content categories - pure image, image with text, and document image, and 2) Language identification of 8 languages (Arabic, Chinese, English, Hindi, Japanese, Korean, Russian, and Thai) on a 1,512 complex document image database composed of mixed machine printed text and handwriting. Our approach is capable to handle high intra-class variability and shows results that exceed other state-of-the-art approaches, allowing it to be used as a content recognizer in image indexing and retrieval systems.

Original languageEnglish
Title of host publicationComputer Vision - ECCV 2008 - 10th European Conference on Computer Vision, Proceedings
PublisherSpringer Verlag
Pages745-758
Number of pages14
EditionPART 2
ISBN (Print)3540886850, 9783540886853
DOIs
StatePublished - 2008
Event10th European Conference on Computer Vision, ECCV 2008 - Marseille, France
Duration: Oct 12 2008Oct 18 2008

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 2
Volume5303 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference10th European Conference on Computer Vision, ECCV 2008
Country/TerritoryFrance
CityMarseille
Period10/12/0810/18/08

Fingerprint

Dive into the research topics of 'Learning visual shape lexicon for document image content recognition'. Together they form a unique fingerprint.

Cite this