SACNN: Self-attentive Convolutional Neural Network Model for Natural Language Inference

Waris Quamer,Arpit Rai,Vijayalakshmi Saravanan,Rajendra Pamula,Praphula Kumar Jain,Chiranjeev Kumar

doi:10.1145/3426884

Abstract

Inference has been central problem for understanding and reasoning in artificial intelligence. Especially, Natural Language Inference is an interesting problem that has attracted the attention of many researchers. Natural language inference intends to predict whether a hypothesis sentence can be inferred from the premise sentence. Most prior works rely on a simplistic association between the premise and hypothesis sentence pairs, which is not sufficient for learning complex relationships between them. The strategy also fails to exploit local context information fully. Long Short Term Memory (LSTM) or gated recurrent units networks (GRU) are not effective in modeling long-term dependencies, and their schemes are far more complex as compared to Convolutional Neural Networks (CNN). To address this problem of long-term dependency, and to involve context for modeling better representation of a sentence, in this article, a general Self-Attentive Convolution Neural Network (SACNN) is presented for natural language inference and sentence pair modeling tasks. The proposed model uses CNNs to integrate mutual interactions between sentences, and each sentence with their counterparts is taken into consideration for the formulation of their representation. Moreover, the self-attention mechanism helps fully exploit the context semantics and long-term dependencies within a sentence. Experimental results proved that SACNN was able to outperform strong baselines and achieved an accuracy of 89.7% on the stanford natural language inference (SNLI) dataset.

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

SACNN: Self-attentive Convolutional Neural Network Model for Natural Language Inference

Abstract

Published Version

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing

Lead the way for us

Journal: ACM Transactions on Asian and Low-Resource Language Information Processing	Publication Date: May 31, 2021
Citations: 10

Similar Papers

Natural language inference using LSTM model with sentence fusion
Senlin Zhang ... Siyang Liu
-
Senlin Zhang, et. al.Senlin Zhang ... Siyang Liu
01 Jul 2017
01 Jul 2017

Learning Natural Language Inference with LSTM
Shuohang Wang ... Jing Jiang
-
Shuohang Wang, et. al.Shuohang Wang ... Jing Jiang
01 Jan 2015
01 Jan 2015

Attention-Fused Deep Matching Network for Natural Language Inference
Chaoqun Duan ... Xinchi Chen
-
Chaoqun Duan, et. al.Chaoqun Duan ... Xinchi Chen
15 May 2018
15 May 2018

Recurrent Neural Network-Based Sentence Encoder with Gated Attention for Natural Language Inference
Qian Chen ... Si Wei
-
Qian Chen, et. al.Qian Chen ... Si Wei
01 Jan 2017
01 Jan 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

SACNN: Self-attentive Convolutional Neural Network Model for Natural Language Inference

Abstract

Published Version

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing