Gradient rectified parameter unit of the fully connected layer in convolutional neural networks

Tianyou Zheng,Qiang Wang,Yue Shen,Xiaotian Lin

doi:10.1016/j.knosys.2022.108797

Abstract

Existing visualization approaches of the convolutional neural network (CNN) present the importance of the positive gradient in backpropagation, and remove the irrelevant negative gradients for improvement. However, the training procedure in CNN pays the same attention to positive and negative gradients’ optimizations. In this work, we present a gradient rectified parameter unit of the fully connected layer (GRU-FC) approach, which rectifies the corresponding parameters generating the negative gradient in the fully connected layer by zero clearing and retrains the networks with the rectified parameters. Besides, a simplified version of GRU-FC is provided to accelerate the training of the network that has a single fully connected layer for classification. Theoretical analysis of the rationalization of GRU-FC presents that GRU-FC-2L is an appropriate approach for networks with more than one fully connected layer. Experiments on the convergence analysis of the network by GRU-FC-2L is conducted. The GRU-FC approach is verified on several datasets (i.e., SV HN, STL10, CIFAR10 and ImageNet) with the recognition accuracy increased effectively. Furthermore, the GRU-FC approach shows a way to dropout unimportant weights regularly instead of randomness.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Gradient rectified parameter unit of the fully connected layer in convolutional neural networks

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Journal: Knowledge-Based Systems	Publication Date: Apr 18, 2022
Citations: 11

Similar Papers

Heart rate, net transport cost and stride characteristics of horses exercising at walk and trot on positive and negative gradients
R J Williams ... G R Colborne
Comparative Exercise Physiology | VOL. 6
R J Williams, et. al.R J Williams ... G R Colborne
01 Aug 2009
Comparative Exercise Physiology | VOL. 6

The Hot Interstellar Medium in Normal Elliptical Galaxies. III. The Thermal Structure of the Gas
Steven Diehl ... Thomas S Statler
The Astrophysical Journal | VOL. 687
Steven Diehl, et. al.Steven Diehl ... Thomas S Statler
11 Jun 2008
The Astrophysical Journal | VOL. 687

The local Universe in the era of large surveys – III. Radial activity profiles of S0 galaxies
J L Tous ... J D Perea
Monthly Notices of the Royal Astronomical Society | VOL. 528
J L Tous, et. al.J L Tous ... J D Perea
10 Jan 2024
Monthly Notices of the Royal Astronomical Society | VOL. 528

Evaluation of linear gradient loaded columns in temperature programmed gas chromatography
A.B Christophe
Journal of Chromatography A | VOL. 58
A.B ChristopheA.B Christophe
01 Jan 1970
Journal of Chromatography A | VOL. 58

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Gradient rectified parameter unit of the fully connected layer in convolutional neural networks

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems