An improved CRNN for Vietnamese Identity Card Information Recognition

Trinh Tan Dat,Nguyen Nhat Truong,Vu Ngoc Thanh Sang,Pham The Bao,Le Tran Anh Dang,Pham Cung Le Thien Vu,Pham Thi Vuong

doi:10.32604/csse.2022.019064

Trinh Tan Dat, Nguyen Nhat Truong + Show 5 more

Open Access

https://doi.org/10.32604/csse.2022.019064

Copy DOI

Journal: Computer Systems Science and Engineering	Publication Date: Jan 1, 2022
Citations: 3	License type: cc-by

Affiliation: Saigon University

Abstract

This paper proposes an enhancement of an automatic text recognition system for extracting information from the front side of the Vietnamese citizen identity (CID) card. First, we apply Mask-RCNN to segment and align the CID card from the background. Next, we present two approaches to detect the CID card’s text lines using traditional image processing techniques compared to the EAST detector. Finally, we introduce a new end-to-end Convolutional Recurrent Neural Network (CRNN) model based on a combination of Connectionist Temporal Classification (CTC) and attention mechanism for Vietnamese text recognition by jointly train the CTC and attention objective functions together. The length of the CTC’s output label sequence is applied to the attention-based decoder prediction to make the final label sequence. This process helps to decrease irregular alignments and speed up the label sequence estimation during training and inference, instead of only relying on a data-driven attention-based encoder-decoder to estimate the label sequence in long sentences. We may directly learn the proposed model from a sequence of words without detailed annotations. We evaluate the proposed system using a real collected Vietnamese CID card dataset and find that our method provides a 4.28% in WER and outperforms the common techniques.

Highlights

In computer vision, scene text recognition (STR) related to analyzing and understanding high-level semantic information from texts in the image is challenging
3.3.3 Experimental Results on Text Recognition We first explored the effects of the Convolutional Recurrent Neural Network (CRNN) based attention mechanism on manually cropped text line images from the citizen identity (CID) card images
Tab. 4 shows the performance comparison in word error rate (WER) of the CRNN model based on the Connectionist Temporal Classification (CTC) and attention

Summary

Introduction

Scene text recognition (STR) related to analyzing and understanding high-level semantic information from texts in the image is challenging. The STR systems have been extensively and successfully utilized in various applications, such as image retrieval, driver-assisted systems, recognition of personal cards and related documents, etc. The STR systems often include two main procedures: Text detection and recognition. Text detection aims to determine the location of texts from the image. Text recognition is applied to identify the texts and generate a sequence of texts from the text images. Text recognition has been successfully used to extract information from the identity card (ID card) for different languages such as English, Chinese, Vietnamese, Spanish, etc. This study focuses on developing an approach to recognize Vietnamese text and an application to extract information from Vietnamese citizen

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An improved CRNN for Vietnamese Identity Card Information Recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computer Systems Science and Engineering

Lead the way for us

Similar Papers

Convolutional recurrent neural network with attention for Vietnamese speech to text problem in the operating room
Pham The Bao ... Trinh Tan Dat
International Journal of Intelligent Information and Database Systems | VOL. 14
Pham The Bao, et. al.Pham The Bao ... Trinh Tan Dat
01 Jan 2020
International Journal of Intelligent Information and Database Systems | VOL. 14

Channel attention convolutional recurrent neural network on street view symbol recognition
Jinke Li ... Chunyue Wang
Highlights in Science, Engineering and Technology | VOL. 9
Jinke Li, et. al.Jinke Li ... Chunyue Wang
30 Sep 2022
Highlights in Science, Engineering and Technology | VOL. 9

SCUT-EPT: New Dataset and Benchmark for Offline Chinese Text Recognition in Examination Paper
Yuanzhi Zhu ... Xiaoxue Chen
IEEE Access | VOL. 7
Yuanzhi Zhu, et. al.Yuanzhi Zhu ... Xiaoxue Chen
01 Jan 2019
IEEE Access | VOL. 7

Baidu Meizu Deep Learning Competition: Arithmetic Operation Recognition Using End-to-End Learning OCR Technologies
Yuxiang Jiang ... Abdulmotaleb El Saddik
IEEE Access | VOL. 6
Yuxiang Jiang, et. al.Yuxiang Jiang ... Abdulmotaleb El Saddik
01 Jan 2018
IEEE Access | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An improved CRNN for Vietnamese Identity Card Information Recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computer Systems Science and Engineering