Writer-aware CNN for parsimonious HMM-based offline handwritten Chinese text recognition

Zi-Rui Wang,Jun Du,Jia-Ming Wang

doi:10.1016/j.patcog.2019.107102

Abstract

Recently, the hybrid convolutional neural network hidden Markov model (CNN-HMM) has been introduced for offline handwritten Chinese text recognition (HCTR) and has achieved state-of-the-art performance. However, modeling each of the large vocabulary of Chinese characters with a uniform and fixed number of hidden states requires high memory and computational costs and makes the tens of thousands of HMM state classes confusing. Another key issue of CNN-HMM for HCTR is the diversified writing style, which leads to model strain and a significant performance decline for specific writers. To address these issues, we propose a writer-aware CNN based on parsimonious HMM (WCNN-PHMM). First, PHMM is designed using a data-driven state-tying algorithm to greatly reduce the total number of HMM states, which not only yields a compact CNN by state sharing of the same or similar radicals among different Chinese characters but also improves the recognition accuracy due to the more accurate modeling of tied states and the lower confusion among them. Second, WCNN integrates each convolutional layer with one adaptive layer fed by a writer-dependent vector, namely, the writer code, to extract the irrelevant variability in writer information to improve recognition performance. The parameters of writer-adaptive layers are jointly optimized with other network parameters in the training stage, while a multiple-pass decoding strategy is adopted to learn the writer code and generate recognition results. Validated on the ICDAR 2013 competition of CASIA-HWDB database, the more compact WCNN-PHMM of a 7360-class vocabulary can achieve a relative character error rate (CER) reduction of 16.6% over the conventional CNN-HMM without considering language modeling. By adopting a powerful hybrid language model (N-gram language model and recurrent neural network language model), the CER of WCNN-PHMM is reduced to 3.17%. Moreover, the state-tying results of PHMM explicitly show the information sharing among similar characters and the confusion reduction of tied state classes. Finally, we visualize the learned writer codes and demonstrate the strong relationship with the writing styles of different writers. To the best of our knowledge, WCNN-PHMM yields the best results on the ICDAR 2013 competition set, demonstrating its power when enlarging the size of the character vocabulary.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Writer-aware CNN for parsimonious HMM-based offline handwritten Chinese text recognition

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Nov 14, 2019
Citations: 44

Similar Papers

Deep Learning Based Handwritten Chinese Character and Text Recognition
Xu-Yao Zhang ... Yi-Chao Wu
-
Xu-Yao Zhang, et. al.Xu-Yao Zhang ... Yi-Chao Wu
01 Jan 2019
01 Jan 2019

Joint unsupervised adaptation of n-gram and RNN language models via LDA-based hybrid mixture modeling
Ryo Masumura ... Taichi Asami
-
Ryo Masumura, et. al.Ryo Masumura ... Taichi Asami
01 Dec 2017
01 Dec 2017

A comprehensive study of hybrid neural network hidden Markov model for offline handwritten Chinese text recognition
Zi-Rui Wang ... Jun Du
International Journal on Document Analysis and Recognition (IJDAR) | VOL. 21
Zi-Rui Wang, et. al.Zi-Rui Wang ... Jun Du
15 Jun 2018
International Journal on Document Analysis and Recognition (IJDAR) | VOL. 21

Improving handwritten Chinese text recognition using neural network language models and convolutional neural network shape models
Yi-Chao Wu ... Cheng-Lin Liu
Pattern Recognition | VOL. 65
Yi-Chao Wu, et. al.Yi-Chao Wu ... Cheng-Lin Liu
29 Dec 2016
Pattern Recognition | VOL. 65

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Writer-aware CNN for parsimonious HMM-based offline handwritten Chinese text recognition

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition