TY - GEN
T1 - Latent Dirichlet Allocation based Writer Identification in Offline handwriting
AU - Bhardwaj, Anurag
AU - Reddy, Manavender
AU - Setlur, Srirangaraj
AU - Govindaraju, Venu
AU - Ramachandrula, Sitaram
PY - 2010
Y1 - 2010
N2 - In this paper, we describe a novel approach to Writer Identification in Offline handwriting using Latent Dirichlet Allocation. State-of-the-art methods for writer identification employ the traditional feature-classification paradigm which does not provide enough information about the handwriting attributes such as writing style which are key components in any forensic analysis of handwriting. This problem is also compounded due to lack of efficient rules for defining a particular writing style that can capture writer specific characteristics over a large dataset. We propose to address this issue by using a generative model in form of Latent Dirichlet Allocation(LDA) that automatically infers writing styles from handwritten document collection without any pre-defined set of rules. This information is then used to represent each writer as a distribution over multiple writing style for classifying any unknown writer sample. We describe our approach on two different feature sets consisting of contour angle features as well as structural and concavity features. Our experimental results show comparable performance with baseline systems and also demonstrate theefficacy of LDA for learning multiple handwriting styles.
AB - In this paper, we describe a novel approach to Writer Identification in Offline handwriting using Latent Dirichlet Allocation. State-of-the-art methods for writer identification employ the traditional feature-classification paradigm which does not provide enough information about the handwriting attributes such as writing style which are key components in any forensic analysis of handwriting. This problem is also compounded due to lack of efficient rules for defining a particular writing style that can capture writer specific characteristics over a large dataset. We propose to address this issue by using a generative model in form of Latent Dirichlet Allocation(LDA) that automatically infers writing styles from handwritten document collection without any pre-defined set of rules. This information is then used to represent each writer as a distribution over multiple writing style for classifying any unknown writer sample. We describe our approach on two different feature sets consisting of contour angle features as well as structural and concavity features. Our experimental results show comparable performance with baseline systems and also demonstrate theefficacy of LDA for learning multiple handwriting styles.
KW - Handwriting analysis
KW - Latent Dirichlet Allocation
KW - Topic models
KW - Writer Identification
UR - https://www.scopus.com/pages/publications/77955006947
U2 - 10.1145/1815330.1815376
DO - 10.1145/1815330.1815376
M3 - Conference contribution
AN - SCOPUS:77955006947
SN - 9781605587738
T3 - ACM International Conference Proceeding Series
SP - 357
EP - 362
BT - Proceedings of the 9th IAPR International Workshop on Document Analysis Systems, DAS '10
T2 - 2010 IAPR Workshop on Document Analysis Systems, DAS 2010
Y2 - 9 June 2010 through 11 June 2010
ER -