ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs

Wenpeng Yin,Hinrich Schütze,Bing Xiang,Bowen Zhou

doi:10.1162/tacl_a_00097

Abstract

How to model a pair of sentences is a critical issue in many NLP tasks such as answer selection (AS), paraphrase identification (PI) and textual entailment (TE). Most prior work (i) deals with one individual task by fine-tuning a specific system; (ii) models each sentence’s representation separately, rarely considering the impact of the other sentence; or (iii) relies fully on manually designed, task-specific linguistic features. This work presents a general Attention Based Convolutional Neural Network (ABCNN) for modeling a pair of sentences. We make three contributions. (i) The ABCNN can be applied to a wide variety of tasks that require modeling of sentence pairs. (ii) We propose three attention schemes that integrate mutual influence between sentences into CNNs; thus, the representation of each sentence takes into consideration its counterpart. These interdependent sentence pair representations are more powerful than isolated sentence representations. (iii) ABCNNs achieve state-of-the-art performance on AS, PI and TE tasks. We release code at: https://github.com/yinwenpeng/Answer_Selection .

Highlights

How to model a pair of sentences is a critical issue in many NLP tasks such as answer selection (AS) (Yu et al, 2014; Feng et al, 2015), paraphrase identification (PI) (Madnani et al, 2012; Yin and Schutze, 2015a), textual entailment (TE) (Marelli et al, 2014a; Bowman et al, 2015a) etc
We introduce our basic Convolutional Neural Networks (CNNs) that is based on the Siamese architecture (Bromley et al, 1993), i.e., it consists of two weightsharing CNNs, each processing one of the two sentences, and a final layer that solves the sentence pair task
Comparing the Attention Based Convolutional Neural Network (ABCNN)-2 with the ABCNN-1, we find the ABCNN-2 is slightly better even though the ABCNN-2 is the simpler architecture

Summary

Introduction

How to model a pair of sentences is a critical issue in many NLP tasks such as answer selection (AS) (Yu et al, 2014; Feng et al, 2015), paraphrase identification (PI) (Madnani et al, 2012; Yin and Schutze, 2015a), textual entailment (TE) (Marelli et al, 2014a; Bowman et al, 2015a) etc. Most prior work derives each sentence’s representation separately, rarely considering the impact of the other sentence. This neglects the mutual influence of the two sentences in the context of the task. It contradicts what humans do when comparing two sentences. Human beings model the two sentences together, using the content of one sentence to guide the representation of the other

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Transactions of the Association for Computational Linguistics	Publication Date: Dec 1, 2016
Citations: 817	License type: cc-by

R Discovery Prime

R Discovery Prime

ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics

Lead the way for us

Similar Papers

Attention-based deep convolutional neural network for spectral efficiency optimization in MIMO systems
Danfeng Sun ... Lutz Rauchhaupt
Neural Computing and Applications | VOL. 35
Danfeng Sun, et. al.Danfeng Sun ... Lutz Rauchhaupt
09 Jul 2020
Neural Computing and Applications | VOL. 35

Attention-based Convolutional Neural Network for Answer Selection using BERT
Hanieh Khorashadizadeh ... Shima Foolad
-
Hanieh Khorashadizadeh, et. al.Hanieh Khorashadizadeh ... Shima Foolad
01 Sep 2020
01 Sep 2020

ECG-based multi-class arrhythmia detection using spatio-temporal attention-based convolutional recurrent neural network.
Jing Zhang ... Xun Chen
Artificial Intelligence in Medicine | VOL. 106
Jing Zhang, et. al.Jing Zhang ... Xun Chen
11 May 2020
Artificial Intelligence in Medicine | VOL. 106

ABCNN-IDS: Attention-Based Convolutional Neural Network for Intrusion Detection in IoT Networks
Asadullah Momand ... Sana Ullah Jan
Wireless Personal Communications | VOL. 136
Asadullah Momand, et. al.Asadullah Momand ... Sana Ullah Jan
01 Jun 2024
Wireless Personal Communications | VOL. 136

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics