Design and Investigation of Capsule Networks for Sentence Classification

Haftu Wedajo Fentaw,Tae-Hyong Kim

doi:10.3390/app9112200

Haftu Wedajo Fentaw, Tae-Hyong Kim

Open Access

https://doi.org/10.3390/app9112200

Copy DOI

Journal: Applied Sciences	Publication Date: May 29, 2019
Citations: 15	License type: CC BY 4.0

Affiliation: Kumoh National Institute of Technology

Abstract

In recent years, convolutional neural networks (CNNs) have been used as an alternative to recurrent neural networks (RNNs) in text processing with promising results. In this paper, we investigated the newly introduced capsule networks (CapsNets), which are getting a lot of attention due to their great performance gains on image analysis more than CNNs, for sentence classification or sentiment analysis in some cases. The results of our experiment show that the proposed well-tuned CapsNet model can be a good, sometimes better and cheaper, substitute of models based on CNNs and RNNs used in sentence classification. In order to investigate whether CapsNets can learn the sequential order of words or not, we performed a number of experiments by reshuffling the test data. Our CapsNet model shows an overall better classification performance and better resistance to adversarial attacks than CNN and RNN models.

Highlights

Sentence classification is a fundamental problem in natural language processing (NLP) which aims to organize a very large amount of textual information into groups so that users can access this bulk information with so much ease
We present a comparison of the accuracy of our CapsNet model with other state-of-the-art models, a visualization of the capsule outputs, and a detailed error inspection in the inference stage
The datasets used in the experiment are the following and details of these datasets are given in movie reviews (MR) Dataset: movie review dataset [23]

Summary

Introduction

Sentence classification is a fundamental problem in natural language processing (NLP) which aims to organize a very large amount of textual information into groups so that users can access this bulk information with so much ease. Previous research on sentence classification or sentiment analysis relied on using different methods ranging from non-neural network based approaches like topical categorization and knowledge based models to neural network based techniques. Replacing CNNs by CapsNet, which can adequately capture the spatial relations between features, has higher potential for a better representation and understanding of a given NLP task. In this work, which is an extension of work presented in [6], we present a CapsNet-based model designed for sentence classification, sentiment analysis in some cases, using a Word2vec model as the vector encoding technique. The results of our experiment show that our well-tuned CapsNet model outperforms CNN models and can be a good, sometimes better and cheaper, substitute of much slower RNN-based models used in sentence classification

Related Work

Proposed Model

Preprocessing

Sentence Model

Convolution Stage

Primary Caps Layer

Class Caps Layer

Experiments, Results and Discussion

Baseline Models

CapsNet Model

Effect of Kernel Sizes and the Number of Parallel Layers

Looking inside Internal Layers

Error Inspection

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Design and Investigation of Capsule Networks for Sentence Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Service-oriented model-based fault prediction and localization for service compositions testing using deep learning techniques
Roaa Elghondakly ... Nagwa Badr
Applied Soft Computing | VOL. 143
Roaa Elghondakly, et. al.Roaa Elghondakly ... Nagwa Badr
18 May 2023
Applied Soft Computing | VOL. 143

Tunnel boring machine vibration-based deep learning for the ground identification of working faces
Mengbo Liu ... Yanqing Men
Journal of Rock Mechanics and Geotechnical Engineering | VOL. 13
Mengbo Liu, et. al.Mengbo Liu ... Yanqing Men
01 Dec 2021
Journal of Rock Mechanics and Geotechnical Engineering | VOL. 13

A performance evaluation of convolutional and recurrent neural network on Philippine typhoon data
Justin Raz ... Christine Peña
-
Justin Raz, et. al.Justin Raz ... Christine Peña
01 Jan 2023
01 Jan 2023

Comparison of Tree-Structured Parzen Estimator Optimization in Three Typical Neural Network Models for Landslide Susceptibility Assessment
Guangzhi Rong ... Jiquan Zhang
Remote Sensing | VOL. 13
Guangzhi Rong, et. al.Guangzhi Rong ... Jiquan Zhang
20 Nov 2021
Remote Sensing | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Design and Investigation of Capsule Networks for Sentence Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences