Compressing CNN-DBLSTM models for OCR with teacher-student learning and Tucker decomposition

Haisong Ding,Kai Chen,Qiang Huo

doi:10.1016/j.patcog.2019.07.002

Abstract

Integrated convolutional neural network (CNN) and deep bidirectional long short-term memory (DBLSTM) based character models have achieved excellent recognition accuracies on optical character recognition (OCR) tasks, along with large amount of model parameters and massive computation cost. To deploy CNN-DBLSTM model in products with CPU server, there is an urgent need to compress and accelerate it as much as possible, especially the CNN part, which dominates both parameters and computation. In this paper, we study teacher-student learning and Tucker decomposition methods to reduce model size and runtime latency for CNN-DBLSTM based character model for OCR. We use teacher-student learning to transfer the knowledge of a large-size teacher model to a small-size compact student model, followed by Tucker decomposition to further compress the student model. For teacher-student learning, we design a novel learning criterion to bring in the guidance of succeeding LSTM layer when matching the CNN-extracted feature sequences of the large teacher and small student models. Experimental results on large scale handwritten and printed OCR tasks show that, using teacher-student learning alone achieves 9.90 × footprint reduction and 15.23 × inference speedup yet without degrading recognition accuracy. Combined with Tucker decomposition method, we can compress and accelerate the model further. The decomposed model achieves 11.89 × footprint reduction and 22.16 × inference speedup while suffering no or only a small recognition accuracy degradation against the large-size baseline model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Compressing CNN-DBLSTM models for OCR with teacher-student learning and Tucker decomposition

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Jul 12, 2019
Citations: 28

Similar Papers

Building Compact CNN-DBLSTM Based Character Models for Handwriting Recognition and OCR by Teacher-Student Learning
Haisong Ding ... Meng Cai
-
Haisong Ding, et. al.Haisong Ding ... Meng Cai
01 Aug 2018
01 Aug 2018

Improved parcel sorting by combining automatic speech and character recognition
Amriteshwar Singh ... John H L Hansen
-
Amriteshwar Singh, et. al.Amriteshwar Singh ... John H L Hansen
01 Jan 2012
01 Jan 2012

A Compact CNN-DBLSTM Based Character Model for Offline Handwriting Recognition with Tucker Decomposition
Haisong Ding ... Kai Chen
-
Haisong Ding, et. al.Haisong Ding ... Kai Chen
01 Nov 2017
01 Nov 2017

Building an efficient OCR system for historical documents with little training data
Jiří Martínek ... Ladislav Lenc
Neural Computing and Applications | VOL. 32
Jiří Martínek, et. al.Jiří Martínek ... Ladislav Lenc
09 May 2020
Neural Computing and Applications | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Compressing CNN-DBLSTM models for OCR with teacher-student learning and Tucker decomposition

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition