End-to-end subtitle detection and recognition for videos in East Asian languages via CNN ensemble

Yan Xu,Siyuan Shan,Ziming Qiu,Zhipeng Jia,Zhengyang Shen,Yipei Wang,Mengfei Shi,Eric I-Chao Chang

doi:10.1016/j.image.2017.09.013

Abstract

In this paper, we propose an innovative end-to-end subtitle detection and recognition system for videos in East Asian languages. Our end-to-end system consists of multiple stages. Subtitles are firstly detected by a novel image operator based on the sequence information of consecutive video frames. Then, an ensemble of Convolutional Neural Networks (CNNs) trained on synthetic data is adopted for detecting and recognizing East Asian characters. Finally, a dynamic programming approach leveraging language models is applied to constitute results of the entire body of text lines. The proposed system achieves average end-to-end accuracies of 98.2% and 98.3% on 40 videos in Simplified Chinese and 40 videos in Traditional Chinese respectively, which is a significant outperformance of other existing methods. The near-perfect accuracy of our system dramatically narrows the gap between human cognitive ability and state-of-the-art algorithms used for such a task.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

End-to-end subtitle detection and recognition for videos in East Asian languages via CNN ensemble

Abstract

Talk to us

Similar Papers

More From: Signal Processing: Image Communication

Lead the way for us

Journal: Signal Processing: Image Communication	Publication Date: Oct 16, 2017
Citations: 37

Similar Papers

Ensemble of Deep Convolutional Neural Networks for Automatic Pavement Crack Detection and Measurement
Zhun Fan ... Giuseppe Loprencipe
Coatings | VOL. 10
Zhun Fan, et. al.Zhun Fan ... Giuseppe Loprencipe
08 Feb 2020
Coatings | VOL. 10

Ensemble of PANORAMA-based convolutional neural networks for 3D model classification and retrieval
Konstantinos Sfikas ... Theoharis Theoharis
Computers & Graphics | VOL. 71
Konstantinos Sfikas, et. al.Konstantinos Sfikas ... Theoharis Theoharis
13 Dec 2017
Computers & Graphics | VOL. 71

An Ensemble of Fine-Tuned Convolutional Neural Networks for Medical Image Classification.
Ashnil Kumar ... David Lyndon
IEEE Journal of Biomedical and Health Informatics | VOL. 21
Ashnil Kumar, et. al.Ashnil Kumar ... David Lyndon
05 Dec 2016
IEEE Journal of Biomedical and Health Informatics | VOL. 21

Local minima found in the subparameter space can be effective for ensembles of deep convolutional neural networks
Yongquan Yang ... Zhongxi Zheng
Pattern Recognition | VOL. 109
Yongquan Yang, et. al.Yongquan Yang ... Zhongxi Zheng
08 Aug 2020
Pattern Recognition | VOL. 109

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

End-to-end subtitle detection and recognition for videos in East Asian languages via CNN ensemble

Abstract

Talk to us

Similar Papers

More From: Signal Processing: Image Communication