Dense and Tight Detection of Chinese Characters in Historical Documents: Datasets and a Recognition Guided Detector

Hailin Yang,Songxuan Lai,Lianwen Jin,Jifeng Sun,Weiguo Huang,Zhaoyang Yang

doi:10.1109/access.2018.2840218

Hailin Yang, Songxuan Lai + Show 4 more

Open Access

https://doi.org/10.1109/access.2018.2840218

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2018
Citations: 68	License type: cc-by-nc-nd

Affiliation: South China University of Technology

Abstract

Characters in historical documents are typically densely distributed and are difficult to localize and segment by directly applying classic proposal and regression based methods. In this paper, we propose a novel method called recognition guided detector (RGD) that achieves tight Chinese character detection in historical documents. The proposed RGD consists of two simultaneously trained convolutional neural networks: a recognition guided proposal network that provides context information of the text and a detection network that applies this information to localize each of the characters accurately. To train and test the proposed method, we established two new datasets with character-level annotations, comprising ground truth character bounding boxes and ground truth characters in each of the boxes. The data in our datasets are scanned images collected from nine different versions of Tripitaka in Han. Experimental results show that, guided by a text recognition network with a test accuracy of 97.25%, the detection network in our proposed method achieves a much higher F-score with fewer parameters under a highly constrained evaluation criterion of intersection of union (IoU) ≥ 0.7, when comparing to several state-of-the-art object detection and text detection methods. The datasets are publicly available at https://github.com/HCIILAB/TKH_MTH_Datasets_Release for non-commercial use.

Full Text