A kernel-based weight decorrelation for regularizing CNNs

Yanhong Zhang,Fei Zhu

doi:10.1016/j.neucom.2020.11.065

Abstract

Recent years has witnessed the success of convolutional neural networks (CNNs) in many machine learning and pattern recognition applications, especially in image recognition. However, due to the increasing model complexity, the parameter redundancy problem arises, and greatly degrades the performance of CNNs. To alleviate this problem, various regularization techniques, such as Dropout, have been proposed and proved their effectiveness. In this paper, we propose a novel adaptive kernel-based weight decorrelation (AKWD) framework, in order to regularize CNNs for better generalization. Different from existing works, the correlation between paring weights is measured by the cosine distance defined in RKHS associated with a specific kernel. The case with the well-known Gaussian kernel is investigated in detail, where the bandwidth parameter is adaptively estimated. By regularizing CNN models of different capacities using AKWD, better performance is achieved on several benchmark databases for both object classification and face verification tasks. In particular, when Dropout or BatchNorm is present, even higher improvements are obtained using the proposed AKWD, that demonstrates a good compatibility of the proposed regularizer with other regularization techniques.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A kernel-based weight decorrelation for regularizing CNNs

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Dec 9, 2020
Citations: 2

Similar Papers

Understanding the Convolutional Neural Networks with Gradient Descent and Backpropagation
Xuefei Zhou
Journal of Physics: Conference Series | VOL. 1004
Xuefei ZhouXuefei Zhou
01 Apr 2018
Journal of Physics: Conference Series | VOL. 1004

Computer vision and machine learning for image recognition: A review of the convolutional neural network (CNN) model
Ajay Rana ... Kuldeep Chauhan
Asian Journal of Multidimensional Research | VOL. 10
Ajay Rana, et. al.Ajay Rana ... Kuldeep Chauhan
01 Jan 2020
Asian Journal of Multidimensional Research | VOL. 10

Convolutional Neural Network Model in Machine Learning Methods and Computer Vision for Image Recognition: A Review
...
Journal of Applied Sciences Research | VOL. -
, et. al. ...
01 Jan 2018
Journal of Applied Sciences Research | VOL. -

Plants meet machines: Prospects in machine learning for plant biology
Pamela S Soltis ... Alina Zare
Applications in Plant Sciences | VOL. 8
Pamela S Soltis, et. al.Pamela S Soltis ... Alina Zare
01 Jun 2020
Applications in Plant Sciences | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A kernel-based weight decorrelation for regularizing CNNs

Abstract

Talk to us

Similar Papers

More From: Neurocomputing