Efficient Convolution Neural Networks for Object Tracking Using Separable Convolution and Filter Pruning

Yuanhong Mao,Zhanzhuang He,Zhong Ma,Zhuping Wang,Xuehan Tang

doi:10.1109/access.2019.2932733

Yuanhong Mao, Zhanzhuang He + Show 3 more

Open Access

https://doi.org/10.1109/access.2019.2932733

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 19	License type: CC BY 4.0

Affiliation: Beijing Microelectronics Technology Institute

Abstract

Object tracking based on deep learning is a hot topic in computer vision with many applications. Due to high computation and memory costs, it is difficult to deploy convolutional neural networks (CNNs) for object tracking on embedded systems with limited hardware resources. This paper uses the Siamese network to construct the backbone of our tracker. The convolution layers used to extract features often have the highest costs, so more improvements should be focused on them to make the tracking more efficient. In this paper, the standard convolution is optimized by the separable convolution, which mainly includes a depthwise convolution and a pointwise convolution. To further reduce the calculation, filters in the depthwise convolution layer are pruned with filters variance. As there are different weight distributions in convolution layers, the filter pruning is guided by a hyper-parameter designed. With the improvements, the number of parameters is decreased to 13% of the original network and the computation is reduced to 23%. On the NVIDIA Jetson TX2, the tracking speed increased to 3.65 times on the CPU and 2.08 times on the GPU, without significant degradation of tracking performance in VOT benchmark.

Highlights

Visual object tracking tasks predict the object region in the subsequent frames when its size and position are given in the first video frame
With the application of deep learning in object tracking, the millions of parameters and huge computation in convolutional neural networks (CNNs) are a challenge for tracking performance
The standard CNN network is improved by separable convolution and filters pruning

Summary

INTRODUCTION

Visual object tracking tasks predict the object region in the subsequent frames when its size and position are given in the first video frame. Convolution layers in deep neural networks usually extract the features of the object region and each video frame. This process incurs most of the parameters and calculations in tracking networks. Filters in the depthwise convolution layer are pruned in trained models. We suggest these filters could contribute little to features extraction These filters can be pruned to reduce network size and computation further. In the subsequent 1 × 1 pointwise convolution, the number of channels in the 1 × 1 pointwise filters will be reduced due to the reduction of input feature maps.

EXPERIMENT SETUP

EVALUATION METRICS

Findings

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient Convolution Neural Networks for Object Tracking Using Separable Convolution and Filter Pruning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Design of a Sparsity-Aware Reconfigurable Deep Learning Accelerator Supporting Various Types of Operations
Shen-Fu Hsiao ... Bo-Ching Tsai
IEEE Journal on Emerging and Selected Topics in Circuits and Systems | VOL. 10
Shen-Fu Hsiao, et. al.Shen-Fu Hsiao ... Bo-Ching Tsai
01 Sep 2020
IEEE Journal on Emerging and Selected Topics in Circuits and Systems | VOL. 10

Fault Recognition of Analog Circuits Based on Ultra-Lightweight Subspace Attention Module
Aihua Zhang ... Xinglong Yu
-
Aihua Zhang, et. al.Aihua Zhang ... Xinglong Yu
21 Jul 2021
21 Jul 2021

Efficient Inference of Large-Scale and Lightweight Convolutional Neural Networks on FPGA
Xiao Wu ... Zhongfeng Wang
-
Xiao Wu, et. al.Xiao Wu ... Zhongfeng Wang
08 Sep 2020
08 Sep 2020

Tiny YOLO Optimization Oriented Bus Passenger Object Detection
Shuo Zhang ... Xiaosong Li
Chinese Journal of Electronics | VOL. 29
Shuo Zhang, et. al.Shuo Zhang ... Xiaosong Li
01 Jan 2020
Chinese Journal of Electronics | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Convolution Neural Networks for Object Tracking Using Separable Convolution and Filter Pruning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access