Convolutional Neural Network Filter Research Articles

In recent years, stance detection has become an important topic in the field of natural language processing. In earlier work, researchers have used feature engineering for stance detection but they need to define and extract appropriate features according to the particular application. This leads to poor generalization and a complex modeling process. Other researchers have applied deep learning methods. However, the popular convolutional neural network (CNN) method has the problem of information loss and a single-size CNN filter cannot accurately extract features that have different lengths from text, and so cannot deal with the diverse nature of features. In order to address these problems, we propose a two-channel CNN-GRU fusion network. First, a convolution layer with two filters with different window sizes is used to extract local features within the topic content and text content. Then, a gated recurrent unit (GRU) network is used to extract their timing characteristics. After that, the intermediate features are spliced and input to a classifier to complete the stance detection. Our method is validated using data from NLPCC 2016. The experimental results show that ACC and average F1 score of this method are 13.1% and 15.6% better than SVM method, 6.2% and 11.6% better than CNN method, 5.6% and 3.3% better than GRU method, and 1.1% and 2.2% better compared with hybrid model proposed by Nanyu, respectively, which is used as a baseline with no increase in run-time, and achieves the same accuracy with less run-time than another baseline of a semantic attention-based model proposed by Zhou. In addition, our method allows better classification than the single channel model. Finally, we find that the operation time of a multi-channel CNN-GRU increases gradually with increasing number of channels, but the classification accuracy does not improve, so a two-channel CNN-GRU is the most appropriate choice.

Rank pooling is a temporal encoding method that summarizes the dynamics of a video sequence to a single vector which has shown good results in human action recognition in prior work. In this work, we present novel temporal encoding methods for action and activity classification by extending the unsupervised rank pooling temporal encoding method in two ways. First, we present discriminative rank pooling in which the shared weights of our video representation and the parameters of the action classifiers are estimated jointly for a given training dataset of labelled vector sequences using a bilevel optimization formulation of the learning problem. When the frame level features vectors are obtained from a convolutional neural network (CNN), we rank pool the network activations and jointly estimate all parameters of the model, including CNN filters and fully-connected weights, in an end-to-end manner which we coined as end-to-end trainable rank pooled CNN. Importantly, this model can make use of any existing convolutional neural network architecture (e.g., AlexNet or VGG) without modification or introduction of additional parameters. Then, we extend rank pooling to a high capacity video representation, called hierarchical rank pooling. Hierarchical rank pooling consists of a network of rank pooling functions, which encode temporal semantics over arbitrary long video clips based on rich frame level features. By stacking non-linear feature functions and temporal sub-sequence encoders one on top of the other, we build a high capacity encoding network of the dynamic behaviour of the video. The resulting video representation is a fixed-length feature vector describing the entire video clip that can be used as input to standard machine learning classifiers. We demonstrate our approach on the task of action and activity recognition. We present a detailed analysis of our approach against competing methods and explore variants such as hierarchy depth and choice of non-linear feature function. Obtained results are comparable to state-of-the-art methods on three important activity recognition benchmarks with classification performance of 76.7% mAP on Hollywood2, 69.4% on HMDB51, and 93.6% on UCF101.

Convolutional Neural Network Filter Research Articles

Related Topics

Articles published on Convolutional Neural Network Filter

Stance Detection of Microblog Text Based on Two-Channel CNN-GRU Fusion Network

An Always-On 3.8 <inline-formula> <tex-math notation="LaTeX">$\mu$ </tex-math> </inline-formula>J/86% CIFAR-10 Mixed-Signal Binary CNN Processor With All Memory on Chip in 28-nm CMOS

A deep CNN based transfer learning method for false positive reduction

Convolutional adaptive denoising autoencoders for hierarchical feature extraction

Ambience Inhaling: Speech Noise Inhaler in Mobile Robots using Deep Learning

Discriminatively Learned Hierarchical Rank Pooling Networks

A-optimal convolutional neural network

Building Correlations Between Filters in Convolutional Neural Networks.

Convolutional deep learning for 3D object retrieval

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Convolutional Neural Network Filter Research Articles

Related Topics

Articles published on Convolutional Neural Network Filter

Stance Detection of Microblog Text Based on Two-Channel CNN-GRU Fusion Network

An Always-On 3.8 &lt;inline-formula&gt; &lt;tex-math notation="LaTeX"&gt;$\mu$ &lt;/tex-math&gt; &lt;/inline-formula&gt;J/86% CIFAR-10 Mixed-Signal Binary CNN Processor With All Memory on Chip in 28-nm CMOS

A deep CNN based transfer learning method for false positive reduction

Convolutional adaptive denoising autoencoders for hierarchical feature extraction

Ambience Inhaling: Speech Noise Inhaler in Mobile Robots using Deep Learning

Discriminatively Learned Hierarchical Rank Pooling Networks

A-optimal convolutional neural network

Building Correlations Between Filters in Convolutional Neural Networks.

Convolutional deep learning for 3D object retrieval

An Always-On 3.8 <inline-formula> <tex-math notation="LaTeX">$\mu$ </tex-math> </inline-formula>J/86% CIFAR-10 Mixed-Signal Binary CNN Processor With All Memory on Chip in 28-nm CMOS