Transformer-Based Neural Network for Answer Selection in Question Answering

Taihua Shao,Honghui Chen,Zepeng Hao,Yupu Guo

doi:10.1109/access.2019.2900753

Taihua Shao, Honghui Chen + Show 2 more

Open Access

https://doi.org/10.1109/access.2019.2900753

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 106	License type: cc-by-nc-nd

Affiliation: National University of Defense Technology

Abstract

Answer selection is a crucial subtask in the question answering (QA) system. Conventional avenues for this task mainly concentrate on developing linguistic tools that are limited in both performance and practicability. Answer selection approaches based on deep learning have been well investigated with the tremendous success of deep learning in natural language processing. However, the traditional neural networks employed in existing answer selection models, i.e., recursive neural network or convolutional neural network, typically suffer from obtaining the global text information due to their operating mechanisms. The recent Transformer neural network is considered to be good at extracting the global information by employing only self-attention mechanism. Thus, in this paper, we design a Transformer-based neural network for answer selection, where we deploy a bidirectional long short-term memory (BiLSTM) behind the Transformer to acquire both global information and sequential features in the question or answer sentence. Different from the original Transformer, our Transformer-based network focuses on sentence embedding rather than the seq2seq task. In addition, we employ a BiLSTM rather than utilizing the position encoding to incorporate sequential features as the universal Transformer does. Furthermore, we apply three aggregated strategies to generate sentence embeddings for question and answer, i.e., the weighted mean pooling, the max pooling, and the attentive pooling, leading to three corresponding Transformer-based models, i.e., QA-TF $_{{W\!P}}$ , QA-TF $_{{M\!P}}$ , and QA-TF $_{{A\!P}}$ , respectively. Finally, we evaluate our proposals on a popular QA dataset WikiQA. The experimental results demonstrate that our proposed Transformer-based answer selection models can produce a better performance compared with several competitive baselines. In detail, our best model outperforms the state-of-the-art baseline by up to 2.37%, 2.83%, and 3.79% in terms of MAP, MRR, and accuracy, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Transformer-Based Neural Network for Answer Selection in Question Answering

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Refined Answer Selection Method with Attentive Bidirectional Long Short-Term Memory Network and Self-Attention Mechanism for Intelligent Medical Service Robot
Deguang Wang ... Ye Liang
Applied Sciences | VOL. 13
Deguang Wang, et. al.Deguang Wang ... Ye Liang
26 Feb 2023
Applied Sciences | VOL. 13

Representation Learning and Learning from Limited Labeled Data for Community Question Answering

-

01 Jan 2020
01 Jan 2020

Collaborative Learning for Answer Selection in Question Answering
Taihua Shao ... Pengfei Zhang
IEEE Access | VOL. 7
Taihua Shao, et. al.Taihua Shao ... Pengfei Zhang
01 Jan 2019
IEEE Access | VOL. 7

Double attention recurrent convolution neural network for answer selection.
Ganchao Bao ... Hongli Zhang
Royal Society open science | VOL. 7
Ganchao Bao, et. al.Ganchao Bao ... Hongli Zhang
01 May 2020
Royal Society open science | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transformer-Based Neural Network for Answer Selection in Question Answering

Abstract

Talk to us

Similar Papers

More From: IEEE Access