CNN Layers Research Articles

In this paper, we propose a Deep Reinforcement learning based approach for Learning to rank task. Reinforcement Learning has been applied in the ranking task with good success, but the existing Policy Gradient based approaches suffer from noisy gradients and high variance, resulting in unstable learning. The natural policy gradient methods like REINFORCE perform Monte Carlo sampling, thus taking samples randomly, which leads to high variance. As the action space becomes large, i.e., with a very large number of documents, traditional RL techniques lack the complex model required in the scenario to deal with a large number of items. We propose a Deep Reinforcement learning based approach for learning to rank task in this paper to address these issues. By combining Deep learning with the Reinforcement Learning framework, our approach can learn a complex function as deep neural networks can provide significant function approximation. We used Actor-Critic framework where the critic network can reduce variance by utilizing techniques such as clipped delayed policy updates, clipped double q learning, etc. Also, due to the enormous space of the web, the most relevant results are needed to be returned for the corresponding query from within a large action space. Policy gradient algorithms have been effectively applied to problems in large action spaces(items) with deep neural networks as they don’t rely on finding value for each action(item) as in value-based methods. Further, we use an actor-network with a CNN layer in the ranking process to capture the sequential patterns among the documents. We utilize the TD3 method to train our Reinforcement Learning agent with a listwise loss function, which performs delayed policy updates resulting in value estimates with lower variance. To the best of our knowledge, this is the first Deep reinforcement learning method applied in Learning to Rank for document retrieval. We performed experiments on the various Letor datasets and showed that our method outperforms various state-of-the-art baselines.

CNNs are difficult to achieve inter-layer parallelism because of the data dependence between layers. In the paper, we propose Symbolic-Execution CNN (SE-CNN), which breaks data dependence between CNN layers via value prediction. Our insight is that in post-trained CNNs, only a subset (less than 10%) of neurons are activated (producing non-zero values) to identify patterns in inputs while most of the other neurons remain silent (producing zeros). This is because given an input image there are only a limited number of features presented. Thus, within the CNN, the neurons that are sensitive to the given features are more exercised than the others. Based on this insight, SE-CNN works in two successive phases: A parallel computation phase and a serial correction phase. In the parallel computation phase, each CNN layer starts computation simultaneously based on predicted inputs: we predict most of the neurons having zeros as inputs. For non-zero input neurons we predict their inputs for the next input image the same as the previous ones. In the serial correction phase, each layer compares the predicted inputs with the real ones to correct its computation results if necessary. If a neuron has predicted correctly its input during the parallel phase, thus the corresponding neuron passes its serial phase. Otherwise the neuron will amend its prediction with a light-weight result amendment mechanism based on the real inputs. We implement SE-CNN on top of the streaming processor of a state-of-the-art general purpose GPU (GPGPU) architecture, adding marginal hardware overheads in area and power consumption. We also provide application programming interfaces (APIs) so that CNNs that have already been implemented can directly enjoy the benefits of our technique. We utilize GPGPU-sim as our experimental platform, benchmarked with 9 well-accepted CNNs from recent years’ ILSVRC contests. Experimental results show that compared to other three state-of-the-art GPU based CNN acceleration mechanisms, SE-CNN can averagely achieve <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$13.4\times $ </tex-math></inline-formula> , <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$10.4\times $ </tex-math></inline-formula> and <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$7.9\times $ </tex-math></inline-formula> (maximally <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$22.0\times $ </tex-math></inline-formula> , <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$18.7\times $ </tex-math></inline-formula> and <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$16.4\times $ </tex-math></inline-formula> ) CNN execution speedup while maintaining over 95% of classification accuracy.

CNN Layers Research Articles

Related Topics

Articles published on CNN Layers

Fabric Fault and Extra Thread Detection using Convolutional Neural Network

Agamotto: A Performance Optimization Framework for CNN Accelerator With Row Stationary Dataflow

Design of a Music Recommendation Device Using Mini-Xception CNN and Facial Recognition

Attention-based Speech Emotion Recognition Approach for Medical Application

Optimized deep learning-based cricket activity focused network and medium scale benchmark

Towards COVID-19 fake news detection using transformer-based models

A deep actor critic reinforcement learning framework for learning to rank

Multi gait recognition using Clustering based Faster Regions-Convolutional Neural Network

Image-free single-pixel object detection.

Animal Species Recognition with Deep Convolutional Neural Networks from Ecological Camera Trap Images.

REMAP: A Spatiotemporal CNN Accelerator Optimization Methodology and Toolkit Thereof

Web-Informed-Augmented Fake News Detection Model Using Stacked Layers of Convolutional Neural Network and Deep Autoencoder

Evaluation of SEU impact on convolutional neural networks based on BRAM and CRAM in FPGAs

Air Quality Index Prediction

A hybrid deep learning approach for phenotype prediction from clinical notes

SE-CNN: Convolution Neural Network Acceleration via Symbolic Value Prediction

Performance Analysis of Deep Convolutional Network Architectures for Classification of Over-Volume Vehicles

CNN-BiLSTM-Attention: A multi-label neural classifier for short texts with a small set of labels

Optimization of Deep Neural Networks Using DAPP (DNN Acceleration Using Ping-Pong) Approach

Content-Based Image Search While Maintaining Privacy with Cloud Image Repository

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

CNN Layers Research Articles

Related Topics

Articles published on CNN Layers

Fabric Fault and Extra Thread Detection using Convolutional Neural Network

Agamotto: A Performance Optimization Framework for CNN Accelerator With Row Stationary Dataflow

Design of a Music Recommendation Device Using Mini-Xception CNN and Facial Recognition

Attention-based Speech Emotion Recognition Approach for Medical Application

Optimized deep learning-based cricket activity focused network and medium scale benchmark

Towards COVID-19 fake news detection using transformer-based models

A deep actor critic reinforcement learning framework for learning to rank

Multi gait recognition using Clustering based Faster Regions-Convolutional Neural Network

Image-free single-pixel object detection.

Animal Species Recognition with Deep Convolutional Neural Networks from Ecological Camera Trap Images.

REMAP: A Spatiotemporal CNN Accelerator Optimization Methodology and Toolkit Thereof

Web-Informed-Augmented Fake News Detection Model Using Stacked Layers of Convolutional Neural Network and Deep Autoencoder

Evaluation of SEU impact on convolutional neural networks based on BRAM and CRAM in FPGAs

Air Quality Index Prediction

A hybrid deep learning approach for phenotype prediction from clinical notes

SE-CNN: Convolution Neural Network Acceleration via Symbolic Value Prediction

Performance Analysis of Deep Convolutional Network Architectures for Classification of Over-Volume Vehicles

CNN-BiLSTM-Attention: A multi-label neural classifier for short texts with a small set of labels

Optimization of Deep Neural Networks Using DAPP (DNN Acceleration Using Ping-Pong) Approach

Content-Based Image Search While Maintaining Privacy with Cloud Image Repository