Skip to main navigation Skip to search Skip to main content

Document image classification and labeling using multiple instance learning

  • University of Maryland, College Park

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Scopus citations

Abstract

The labeling of large sets of images for training or testing analysis systems can be a very costly and time-consuming process. Multiple instance learning (MIL) is a generalization of traditional supervised learning which relaxes the need for exact labels on training instances. Instead, the labels are required only for a set of instances known as bags. In this paper, we apply MIL to the retrieval and localization of signatures and the retrieval of images containing machine-printed text, and show that a gain of 15-20% in performance can be achieved over the supervised learning with weak-labeling. We also compare our approach to supervised learning with fully annotated training data and report a competitive accuracy for MIL. Using our experiments on real-world datasets, we show that MIL is a good alternative when the training data has only document-level annotation.

Original languageEnglish
Title of host publicationProceedings - 11th International Conference on Document Analysis and Recognition, ICDAR 2011
Pages1059-1063
Number of pages5
DOIs
StatePublished - 2011
Event11th International Conference on Document Analysis and Recognition, ICDAR 2011 - Beijing, China
Duration: Sep 18 2011Sep 21 2011

Publication series

NameProceedings of the International Conference on Document Analysis and Recognition, ICDAR
ISSN (Print)1520-5363

Conference

Conference11th International Conference on Document Analysis and Recognition, ICDAR 2011
Country/TerritoryChina
CityBeijing
Period09/18/1109/21/11

Keywords

  • Document Image Labeling
  • Machine-print Documents
  • Signature Detection

Fingerprint

Dive into the research topics of 'Document image classification and labeling using multiple instance learning'. Together they form a unique fingerprint.

Cite this