Abstract
The task of handwritten Chinese character recognition is one of the most challenging areas of human handwriting classification. The main reason for this is related to the writing system itself which encompasses thousands of characters, coupled with high levels of diversity in personal writing styles and attributes. Much of the existing work for both online and off-line handwritten Chinese character recognition has focused on methods which employ feature extraction and segmentation steps. The preprocessed data from these steps form the basis for the subsequent classification and recognition phases. This paper proposes an approach for handwritten Chinese character recognition and classification using only an image alignment technique and does not require the aforementioned steps. Rather than extracting features from the image, which often means building models from very large training data, the proposed method instead uses the mean image transformations as a basis for model building. The use of an image-only model means that no subjective tuning of the feature extraction is required. In addition by employing a fuzzy-entropy-based metric, the work also entails improved ability to model different types of uncertainty. The classifier is a simple distance-based nearest neighbour classification system based on template matching. The approach is applied to a publicly available real-world database of handwritten Chinese characters and demonstrates that it can achieve high classification accuracy and is robust in the presence of noise.
Highlights
Handwritten Chinese character recognition (HCCR) has been the topic of much interest, in both research circles and for practical applications
The task of HCCR where the domain is non-constrained is still a problem that poses significant challenges (Du et al 2014)
This section details the experiments conducted and the results obtained for the novel HCCR approach using fuzzy-entropy congealing algorithm
Summary
Handwritten Chinese character recognition (HCCR) has been the topic of much interest, in both research circles and for practical applications. The task of HCCR where the domain is non-constrained (in which the uncertainty is not limited to particular training examples of handwritten characters) is still a problem that poses significant challenges (Du et al 2014) This is mainly due to the high levels of diversity of both handwriting styles and large character sets. In most existing approaches to HCCR, a series of steps aimed at extracting features or feature information from images is performed (Du et al 2014; Liu et al 2004) Such approaches typically rely on very large training datasets (in the order of thousands of training examples) (Liu et al 2004; Shao et al 2014), resulting in models of considerable size and complexity. An alternative approach is presented which does not require any feature extraction step Instead, it relies upon an image alignment technique to bring various examples of characters into correspondence.
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have