Skip to main navigation Skip to search Skip to main content

A hierarchical classification model for document categorization

  • Copanion Inc.
  • SUNY Buffalo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

We propose a novel hierarchical classification method for documents categorization in this paper. The approach consists of multiple levels of classification for different hierarchies. Regularized Least Square (RLS) binary classifiers are applied in the middle levels of the hierarchy to classify documents into smaller set of categories and K-nearest-neighbor (KNN) multi-class classifiers are used at the bottom to classify documents into final classes. Experiments on large-scale real world tax documents' show that the proposed hierarchical approach outperforms traditional flat classification method.

Original languageEnglish
Title of host publicationICDAR2009 - 10th International Conference on Document Analysis and Recognition
Pages486-490
Number of pages5
DOIs
StatePublished - 2009
EventICDAR2009 - 10th International Conference on Document Analysis and Recognition - Barcelona, Spain
Duration: Jul 26 2009Jul 29 2009

Publication series

NameProceedings of the International Conference on Document Analysis and Recognition, ICDAR
ISSN (Print)1520-5363

Conference

ConferenceICDAR2009 - 10th International Conference on Document Analysis and Recognition
Country/TerritorySpain
CityBarcelona
Period07/26/0907/29/09

Fingerprint

Dive into the research topics of 'A hierarchical classification model for document categorization'. Together they form a unique fingerprint.

Cite this