Skip to main navigation Skip to search Skip to main content

Latent Dirichlet Allocation based Writer Identification in Offline handwriting

  • SUNY Buffalo
  • Hewlett-Packard

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

12 Scopus citations

Abstract

In this paper, we describe a novel approach to Writer Identification in Offline handwriting using Latent Dirichlet Allocation. State-of-the-art methods for writer identification employ the traditional feature-classification paradigm which does not provide enough information about the handwriting attributes such as writing style which are key components in any forensic analysis of handwriting. This problem is also compounded due to lack of efficient rules for defining a particular writing style that can capture writer specific characteristics over a large dataset. We propose to address this issue by using a generative model in form of Latent Dirichlet Allocation(LDA) that automatically infers writing styles from handwritten document collection without any pre-defined set of rules. This information is then used to represent each writer as a distribution over multiple writing style for classifying any unknown writer sample. We describe our approach on two different feature sets consisting of contour angle features as well as structural and concavity features. Our experimental results show comparable performance with baseline systems and also demonstrate theefficacy of LDA for learning multiple handwriting styles.

Original languageEnglish
Title of host publicationProceedings of the 9th IAPR International Workshop on Document Analysis Systems, DAS '10
Pages357-362
Number of pages6
DOIs
StatePublished - 2010
Event2010 IAPR Workshop on Document Analysis Systems, DAS 2010 - Boston, MA, United States
Duration: Jun 9 2010Jun 11 2010

Publication series

NameACM International Conference Proceeding Series

Conference

Conference2010 IAPR Workshop on Document Analysis Systems, DAS 2010
Country/TerritoryUnited States
CityBoston, MA
Period06/9/1006/11/10

Keywords

  • Handwriting analysis
  • Latent Dirichlet Allocation
  • Topic models
  • Writer Identification

Fingerprint

Dive into the research topics of 'Latent Dirichlet Allocation based Writer Identification in Offline handwriting'. Together they form a unique fingerprint.

Cite this