A Method of Japanese Ancient Text Recognition by Deep Learning

Lehan Chen,Bing Lyu,Hiroyuki Tomiyama,Lin Meng

doi:10.1016/j.procs.2020.06.084

Abstract

Today, when the importance of the country culture is deeply rooted in the hearts of the people, the protection of ancient books and literature has received more and more attention. In this paper, a method to identify texts in ancient books by deep learning is proposed and ancient book “usonarubesh” is chosen as a dataset to test the performance of the model. In this experiment, the layout of the text is extracted into grayscale image through ARU-Net (a neural pixel labeling machine for historical document layout analysis). At the same time the original image which contains the texts is binarized, which the texts are filled with black, while the backgrounds are filled with white. Each area of text is judged by the density of black pixels and the layouts. The cut texts are then selected as the testing dataset for the trained model of deep learning CNN, AlexNet (the training dataset is ready). Finally, the experimental results are analyzed to draw conclusions and to decide the direction of future work.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Procedia Computer Science	Publication Date: Jan 1, 2020
Citations: 13	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

A Method of Japanese Ancient Text Recognition by Deep Learning

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science

Lead the way for us

Similar Papers

Multi-Domain Deep Convolutional Neural Network for Ancient Urdu Text Recognition System
K O Mohammed Aarif ... P Sivakumar
Intelligent Automation & Soft Computing | VOL. 33
K O Mohammed Aarif, et. al.K O Mohammed Aarif ... P Sivakumar
01 Jan 2021
Intelligent Automation & Soft Computing | VOL. 33

Creating Special Literature Resource Databases in Western China Under a Digital Environment
Ji Lu
Bulletin of the American Society for Information Science and Technology | VOL. 29
Ji LuJi Lu
01 Feb 2003
Bulletin of the American Society for Information Science and Technology | VOL. 29

Research on Reconstruction and Cultural Inheritance of Ancient Literature under Digital Expression
Jing Luo
Applied Mathematics and Nonlinear Sciences | VOL. 9
Jing LuoJing Luo
01 Jan 2024
Applied Mathematics and Nonlinear Sciences | VOL. 9

Text-detection and -recognition from natural images

-

10 Feb 2020
10 Feb 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Method of Japanese Ancient Text Recognition by Deep Learning

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science