Kernels In Convolutional Neural Networks Research Articles

Convolutional Neural Networks (CNNs) have demonstrated outstanding performance in various domains, such as face recognition, object detection, and image segmentation. However, the lack of transparency and limited interpretability inherent in CNNs pose challenges in fields such as medical diagnosis, autonomous driving, finance, and military applications. Several studies have explored the interpretability of CNNs and proposed various post-hoc interpretable methods. The majority of these methods are feature-based, focusing on the influence of input variables on outputs. Few methods undertake the analysis of parameters in CNNs and their overall structure. To explore the structure of CNNs and intuitively comprehend the role of their internal parameters, we propose an Attribution Graph-based Interpretable method for CNNs (AGIC) which models the overall structure of CNNs as graphs and provides interpretability from global and local perspectives. The runtime parameters of CNNs and feature maps of each image sample are applied to construct attribution graphs (At-GCs), where the convolutional kernels are represented as nodes and the SHAP values between kernel outputs are assigned as edges. These At-GCs are then employed to pretrain a newly designed heterogeneous graph encoder based on Deep Graph Infomax (DGI). To comprehensively delve into the overall structure of CNNs, the pretrained encoder is used for two types of interpretable tasks: (1) a classifier is attached to the pretrained encoder for the classification of At-GCs, revealing the dependency of At-GC’s topological characteristics on the image sample categories, and (2) a scoring aggregation (SA) network is constructed to assess the importance of each node in At-GCs, thus reflecting the relative importance of kernels in CNNs. The experimental results indicate that the topological characteristics of At-GC exhibit a dependency on the sample category used in its construction, which reveals that kernels in CNNs show distinct combined activation patterns for processing different image categories, meanwhile, the kernels that receive high scores from SA network are crucial for feature extraction, whereas low-scoring kernels can be pruned without affecting model performance, thereby enhancing the interpretability of CNNs.

Read full abstract

Future video prediction provides valuable information that helps a computer machine understand the surrounding environment and make critical decisions in real-time. However, long-term video prediction remains a challenging problem due to the complicated spatiotemporal dynamics in a video. In this paper, we propose a dynamic motion estimation and evolution (DMEE) network model to generate unseen future videos from the observed videos in the past. Our primary contribution is to use trained kernels in convolutional neural network (CNN) and long short-term memory (LSTM) architectures, adapted to each time step and sample position, to efficiently manage spatiotemporal dynamics. DMEE uses the motion estimation (ME) and motion update (MU) kernels to predict the future video using an end-to-end prediction-update process. In the prediction, the ME kernel estimates the temporal changes. In the update step, the MU kernel combines the estimates with the previously generated frames as reference frames using a weighted average. The kernels are not only used for a current frame, but also are evolved to generate successive frames to enable temporally specific filtering. We perform qualitative performance analysis and quantitative performance analysis based on the peak signal-to-noise ratio (PSNR), structural similarity index (SSIM), and video classification score developed for examining the visual quality of the generated video. It is demonstrated with experiments that our algorithm provides better qualitative and quantitative performance superior to the current state-of-the-art algorithms. Our source codes are available in <uri xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">https://github.com/Nayoung-Kim-ICP/Video-Generation</uri> .

Read full abstract

Kernels In Convolutional Neural Networks Research Articles

Related Topics

Articles published on Kernels In Convolutional Neural Networks

Early diagnosis of Alzheimer’s disease using a group self-calibrated coordinate attention network based on multimodal MRI

An attribution graph-based interpretable method for CNNs

Learnable Gabor Kernels in Convolutional Neural Networks for Seismic Interpretation Tasks

Unleashing the full potential of hyperspectral imaging: Decoupled image and frequency-domain spatial–spectral framework

Dead pixel test using effective receptive field

A Low-Power Hardware Architecture for Real-Time CNN Computing.

Label-based, Mini-batch Combinations Study for Convolutional Neural Network Based Fluid-film Bearing Rotor System Diagnosis

Going Deeper into OSNR Estimation with CNN

Naive Gabor Networks for Hyperspectral Image Classification.

Dynamic Motion Estimation and Evolution Video Prediction Network

Fusing HOG and convolutional neural network spatial–temporal features for video‐based facial expression recognition

Deep eigen-filters for face recognition: Feature representation via unsupervised multi-structure filter learning

Hyperspectral Image Classification Based on Spectral and Spatial Information Using Multi-Scale ResNet

A Dual Neural Architecture Combined SqueezeNet with OctConv for LiDAR Data Classification.

Blood Cell Classification Based on Hyperspectral Imaging With Modulated Gabor and CNN.

Automatic Kernel Size Determination for Deep Neural Networks Based Hyperspectral Image Classification

Epithelium-Stroma Classification via Convolutional Neural Networks and Unsupervised Domain Adaptation in Histopathological Images.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Kernels In Convolutional Neural Networks Research Articles

Related Topics

Articles published on Kernels In Convolutional Neural Networks

Early diagnosis of Alzheimer’s disease using a group self-calibrated coordinate attention network based on multimodal MRI

An attribution graph-based interpretable method for CNNs

Learnable Gabor Kernels in Convolutional Neural Networks for Seismic Interpretation Tasks

Unleashing the full potential of hyperspectral imaging: Decoupled image and frequency-domain spatial–spectral framework

Dead pixel test using effective receptive field

A Low-Power Hardware Architecture for Real-Time CNN Computing.

Label-based, Mini-batch Combinations Study for Convolutional Neural Network Based Fluid-film Bearing Rotor System Diagnosis

Going Deeper into OSNR Estimation with CNN

Naive Gabor Networks for Hyperspectral Image Classification.

Dynamic Motion Estimation and Evolution Video Prediction Network

Fusing HOG and convolutional neural network spatial–temporal features for video‐based facial expression recognition

Deep eigen-filters for face recognition: Feature representation via unsupervised multi-structure filter learning

Hyperspectral Image Classification Based on Spectral and Spatial Information Using Multi-Scale ResNet

A Dual Neural Architecture Combined SqueezeNet with OctConv for LiDAR Data Classification.

Blood Cell Classification Based on Hyperspectral Imaging With Modulated Gabor and CNN.

Automatic Kernel Size Determination for Deep Neural Networks Based Hyperspectral Image Classification

Epithelium-Stroma Classification via Convolutional Neural Networks and Unsupervised Domain Adaptation in Histopathological Images.