Understanding How Pretraining Regularizes Deep Learning Algorithms.

Yu Yao,Baosheng Yu,Chen Gong,Tongliang Liu

doi:10.1109/tnnls.2021.3131377

Abstract

Deep learning algorithms have led to a series of breakthroughs in computer vision, acoustical signal processing, and others. However, they have only been popularized recently due to the groundbreaking techniques developed for training deep architectures. Understanding the training techniques is important if we want to further improve them. Through extensive experimentation, Erhan et al. (2010) empirically illustrated that unsupervised pretraining has an effect of regularization for deep learning algorithms. However, theoretical justifications for the observation remain elusive. In this article, we provide theoretical supports by analyzing how unsupervised pretraining regularizes deep learning algorithms. Specifically, we interpret deep learning algorithms as the traditional Tikhonov-regularized batch learning algorithms that simultaneously learn predictors in the input feature spaces and the parameters of the neural networks to produce the Tikhonov matrices. We prove that unsupervised pretraining helps in learning meaningful Tikhonov matrices, which will make the deep learning algorithms uniformly stable and the learned predictor will generalize fast w.r.t. the sample size. Unsupervised pretraining, therefore, can be interpreted as to have the function of regularization.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Neural Networks and Learning Systems	Publication Date: Sep 1, 2023
Citations: 4	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

Understanding How Pretraining Regularizes Deep Learning Algorithms.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems

Lead the way for us

Similar Papers

Feature mining for encrypted malicious traffic detection with deep learning and other machine learning algorithms
Zihao Wang ... Vrizlynn L.L Thing
Computers & Security | VOL. 128
Zihao Wang, et. al.Zihao Wang ... Vrizlynn L.L Thing
17 Feb 2023
Computers & Security | VOL. 128

A deep learning algorithm for multi-source data fusion to predict water quality of urban sewer networks
Yiqi Jiang ... Wenhui Wang
Journal of Cleaner Production | VOL. 318
Yiqi Jiang, et. al.Yiqi Jiang ... Wenhui Wang
03 Aug 2021
Journal of Cleaner Production | VOL. 318

Prediction performance advantages of deep machine learning algorithms for two-phase flow rates through wellhead chokes
Hossein Shojaei Barjouei ... Hossein Saberi
Journal of Petroleum Exploration and Production | VOL. 11
Hossein Shojaei Barjouei, et. al.Hossein Shojaei Barjouei ... Hossein Saberi
23 Feb 2021
Journal of Petroleum Exploration and Production | VOL. 11

Use of a Commercially Available Deep Learning Algorithm to Measure the Solid Portions of Lung Cancer Manifesting as Subsolid Lesions at CT: Comparisons with Radiologists and Invasive Component Size at Pathologic Examination.
Yura Ahn ... Han Na Noh
Radiology | VOL. 299
Yura Ahn, et. al.Yura Ahn ... Han Na Noh
02 Feb 2021
Radiology | VOL. 299

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Understanding How Pretraining Regularizes Deep Learning Algorithms.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems