Recognition of Japanese historical text lines by an attention-based encoder-decoder and text line generation

Anh Duc Le,Daichi Mochihashi,Nam Tuan Ly,Katsuya Masuda,Hideki Mima

doi:10.1145/3352631.3352641

Abstract

Inspired by the recent successes of attention based encoder-decoder (AED) approach on image captioning, machine translation, we present an AED model as an end-to-end recognition system for recognizing Japanese historical documents. The recognition system has two main modules: a dense convolution neural network for extracting features, and a Long Shor Term Memory (LSTM) decoder integrating with attention model for generating target text. We can train the model end-to-end. The model requires only input text line images and corresponding output characters. Therefore, we don't need annotations for characters and save a lot of time for making annotations. We also present a method to generate artificial text lines to solve the imbalance problem of the current annotated database. The results of experiments on the annotated and artificial databases demonstrate the effectiveness of the text line generation. Our recognition system achieved Character Error Rate of 23.76% and 22.52% by training with and without artificial text lines, respectively. Moreover, our recognition system outperforms the CNN-LSTM system, which achieved the state-of-art results in other document recognition tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Recognition of Japanese historical text lines by an attention-based encoder-decoder and text line generation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Integrating Knowledge Into End-to-End Speech Recognition From External Text-Only Data
Ye Bai ... Jiangyan Yi
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 29
Ye Bai, et. al.Ye Bai ... Jiangyan Yi
01 Jan 2020
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 29

Speaker Adaptation for Attention-Based End-to-End Speech Recognition
Zhong Meng ... Yashesh Gaur
-
Zhong Meng, et. al.Zhong Meng ... Yashesh Gaur
15 Sep 2019
15 Sep 2019

Character-Aware Attention-Based End-to-End Speech Recognition
Zhong Meng ... Yashesh Gaur
-
Zhong Meng, et. al.Zhong Meng ... Yashesh Gaur
01 Dec 2019
01 Dec 2019

Attention-Based Personalized Encoder-Decoder Model for Local Citation Recommendation.
Libin Yang ... Tao Dai
Computational Intelligence and Neuroscience | VOL. 2019
Libin Yang, et. al.Libin Yang ... Tao Dai
03 Jun 2019
Computational Intelligence and Neuroscience | VOL. 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Recognition of Japanese historical text lines by an attention-based encoder-decoder and text line generation

Abstract

Talk to us

Similar Papers