TY - GEN
T1 - Knowledge transfer using neural network based approach for handwritten text recognition
AU - Nair, Rathin Radhakrishnan
AU - Sankaran, Nishant
AU - Kota, Bharagava Urala
AU - Tulyakov, Sergey
AU - Setlur, Srirangaraj
AU - Govindaraju, Venu
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/6/22
Y1 - 2018/6/22
N2 - The goal of a writer adaptive handwriting recognition system is to build a model that improves the recognition of a generic recognition model for a specific author. In this work, we show how structural representation learned from a generic writer-independent handwriting recognition model can be customized to individual authors. Convolutional Neural Network has shown outstanding performance in learning image-based representation that was used for classification. Additionally, they have been used along with Recurrent Neural Network (RNN) or its variations like, LSTM and GRU layers to analyze and understand sequences in handwriting recognition, sentence analysis, voice recognition etc. In most cases, the CNNs serve as a feature extractor instead of low-level hand-designed features that were used previously for the above-mentioned classification tasks. We design a method to reuse weights from layers trained on the IAM offline handwritten dataset to compute mid-level image representation for text in the Washington and Moore dataset. We show that despite differences in the writing style, fonts across these datasets, the transferred representation is able to capture a spatio-Temporal representation leading to significantly improved recognition results. We hypothesize that the performance is solely not dependent on the number of samples and the model is evaluated with varying amount of fine-Tuning samples showing promising results backing the hypothesis.
AB - The goal of a writer adaptive handwriting recognition system is to build a model that improves the recognition of a generic recognition model for a specific author. In this work, we show how structural representation learned from a generic writer-independent handwriting recognition model can be customized to individual authors. Convolutional Neural Network has shown outstanding performance in learning image-based representation that was used for classification. Additionally, they have been used along with Recurrent Neural Network (RNN) or its variations like, LSTM and GRU layers to analyze and understand sequences in handwriting recognition, sentence analysis, voice recognition etc. In most cases, the CNNs serve as a feature extractor instead of low-level hand-designed features that were used previously for the above-mentioned classification tasks. We design a method to reuse weights from layers trained on the IAM offline handwritten dataset to compute mid-level image representation for text in the Washington and Moore dataset. We show that despite differences in the writing style, fonts across these datasets, the transferred representation is able to capture a spatio-Temporal representation leading to significantly improved recognition results. We hypothesize that the performance is solely not dependent on the number of samples and the model is evaluated with varying amount of fine-Tuning samples showing promising results backing the hypothesis.
KW - Adaptive recognition
KW - Cnn
KW - Handwriting recognition
KW - Lstm
KW - Transfer learning
UR - https://www.scopus.com/pages/publications/85050277969
U2 - 10.1109/DAS.2018.75
DO - 10.1109/DAS.2018.75
M3 - Conference contribution
AN - SCOPUS:85050277969
T3 - Proceedings - 13th IAPR International Workshop on Document Analysis Systems, DAS 2018
SP - 441
EP - 446
BT - Proceedings - 13th IAPR International Workshop on Document Analysis Systems, DAS 2018
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 13th IAPR International Workshop on Document Analysis Systems, DAS 2018
Y2 - 24 April 2018 through 27 April 2018
ER -