Deep Frequency Principle Towards Understanding Why Deeper Learning Is Faster

Zhiqin John Xu,Hanxu Zhou

doi:10.1609/aaai.v35i12.17261

Abstract

Understanding the effect of depth in deep learning is a critical problem. In this work, we utilize the Fourier analysis to empirically provide a promising mechanism to understand why feedforward deeper learning is faster. To this end, we separate a deep neural network, trained by normal stochastic gradient descent, into two parts during analysis, i.e., a pre-condition component and a learning component, in which the output of the pre-condition one is the input of the learning one. We use a filtering method to characterize the frequency distribution of a high-dimensional function. Based on experiments of deep networks and real dataset, we propose a deep frequency principle, that is, the effective target function for a deeper hidden layer biases towards lower frequency during the training. Therefore, the learning component effectively learns a lower frequency function if the pre-condition component has more layers. Due to the well-studied frequency principle, i.e., deep neural networks learn lower frequency functions faster, the deep frequency principle provides a reasonable explanation to why deeper learning is faster. We believe these empirical studies would be valuable for future theoretical studies of the effect of depth in deep learning.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Frequency Principle Towards Understanding Why Deeper Learning Is Faster

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: May 18, 2021
Citations: 13

Similar Papers

Comprehensive Study for Breast Cancer Using Deep Learning and Traditional Machine Learning
-
ZANCO JOURNAL OF PURE AND APPLIED SCIENCES | VOL. 34
--
12 Apr 2022
ZANCO JOURNAL OF PURE AND APPLIED SCIENCES | VOL. 34

APPLICATION OF DEEP BOLTZMANN MACHINE IN DIAGNOSIS PROCESSES OF HEPATITIS TYPES B & C
Hadis Oftadeh ... Mohammad Manthouri
Azerbaijan Journal of High Performance Computing | VOL. 5
Hadis Oftadeh, et. al.Hadis Oftadeh ... Mohammad Manthouri
01 Jul 2022
Azerbaijan Journal of High Performance Computing | VOL. 5

Deep Active Transfer Learning for Image Recognition
Ankita Singh ... Shayok Chakraborty
-
Ankita Singh, et. al.Ankita Singh ... Shayok Chakraborty
01 Jul 2020
01 Jul 2020

Deep Learning and Applications
Zhu Han ...
-
Zhu Han, et. al.Zhu Han ...
01 Jan 2017
01 Jan 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Frequency Principle Towards Understanding Why Deeper Learning Is Faster

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence