A Layer-Wise Theoretical Framework for Deep Learning of Convolutional Neural Networks

Huu-Thiet Nguyen,Sitan Li,Chien Chern Cheah

doi:10.1109/access.2022.3147869

Huu-Thiet Nguyen, Sitan Li + Show 1 more

Open Access

https://doi.org/10.1109/access.2022.3147869

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2022
Citations: 11	License type: CC BY 4.0

Affiliation: Nanyang Technological University

Abstract

As research attention in deep learning has been focusing on pushing empirical results to a higher peak, remarkable progress has been made in the performance race of machine learning applications in the past years. Yet deep learning based on artificial neural networks still remains difficult to understand as it is considered as a black-box approach. A lack of understanding of deep learning networks from the theoretical perspective would not only hinder the employment of them in applications where high-stakes decisions need to be made, but also limit their future development where artificial intelligence is expected to be robust, predictable and trustable. This paper aims to provide a theoretical methodology to investigate and train deep convolutional neural networks so as to ensure convergence. A mathematical model based on matrix representations for convolutional neural networks is first formulated and an analytic layer-wise learning framework for convolutional neural networks is then proposed and tested on several common benchmarking image datasets. The case studies show a reasonable trade-off between accuracy and analytic learning, and also highlight the potential of employing the proposed layer-wise learning method in finding the appropriate number of layers in actual implementations.

Highlights

Convolutional neural networks (CNNs) have been successfully utilized for various applications with image inputs such as image classification, pattern recognition, object detection, image segmentation
CNN models trained by BP have achieved great success, majority of the achievements are from empirical perspective
This demonstrates the possibility of using the layer-wise learning method as an indicator to determine the appropriate number of layers in final implementations of the models

Summary

INTRODUCTION

Convolutional neural networks (CNNs) have been successfully utilized for various applications with image inputs such as image classification, pattern recognition, object detection, image segmentation. The framework is limited to fully connect networks (FCNs) which cannot be used for full CNNs whose structure is different from the FCNs. In [22], [23], overparameterized networks were analyzed for deep learning by assuming a huge width in each inner layer. Though there is a trade-off in test accuracies in some case studies, the results show that some deep CNNs may not need as many convolutional layers as in their original structure to achieve reasonable accuracies. This demonstrates the possibility of using the layer-wise learning method as an indicator to determine the appropriate number of layers in final implementations of the models.

PROBLEM FORMATION

EQUATION OF DEEP CONVOLUTIONAL NEURAL

FORMULATION

CASE STUDIES

Findings

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Layer-Wise Theoretical Framework for Deep Learning of Convolutional Neural Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Pseudo-labeling of transfer learning convolutional neural network data for human facial emotion recognition
Olena О Arsirii ... Denys V Petrosiuk
Herald of Advanced Information Technology | VOL. 6
Olena О Arsirii, et. al.Olena О Arsirii ... Denys V Petrosiuk
12 Oct 2023
Herald of Advanced Information Technology | VOL. 6

A Bayesian Deep CNN Framework for Reconstructing k-t Undersampled Resting-fMRI

-

29 Dec 2020
29 Dec 2020

A Bayesian Deep CNN Framework for Reconstructing k-t-Undersampled Resting-fMRI
Karan Taneja ... Suyash P Awate
-
Karan Taneja, et. al.Karan Taneja ... Suyash P Awate
10 Jan 2021
10 Jan 2021

DEEP LEARNING TECHNOLOGY OF CONVOLUTIONALNEURAL NETWORKS FOR FACIAL EXPRESSION RECOGNITION
Denys V Petrosiuk ... Olena O Arsirii
Applied Aspects of Information Technology | VOL. 4
Denys V Petrosiuk, et. al.Denys V Petrosiuk ... Olena O Arsirii
30 Jun 2021
Applied Aspects of Information Technology | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Layer-Wise Theoretical Framework for Deep Learning of Convolutional Neural Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access