A Model of Convolutional Neural Network Combined with External Knowledge to Measure the Question Similarity for Community Question Answering Systems

Van-Tu Nguyen,Ha-Nam Nguyen,Anh-Cuong Le

doi:10.18178/ijmlc.2021.11.3.1035

Abstract

Automatically determining similar questions and ranking the obtained questions according to their similarities to each input question is a very important task to any community Question Answering system (cQA). Various methods have applied for this task including conventional machine learning methods with feature extraction and some recent studies using deep learning methods. This paper addresses the problem of how to combine advantages of different methods into one unified model. Moreover, deep learning models are usually only effective for large data, while training data sets in cQA problems are often small, so the idea of integrating external knowledge into deep learning models for this cQA problem becomes more important. To this objective, we propose a neural network-based model which combines a Convolutional Neural Network (CNN) with features from other methods so that the deep learning model is enhanced with addtional knowledge sources. In our proposed model, the CNN component will learn the representation of two given questions, then combined additional features through a Multilayer Perceptron (MLP) to measure similarity between the two questions. We tested our proposed model on the SemEval 2016 task-3 data set and obtain better results in comparison with previous studies on the same task.

Highlights

Nowadays, many community Question Answering system (cQA) forums are becoming more and more popular and really useful such as StackOverflow1 and Quora2
It is a natural way that whenever a cQA system receives a question, it firstly determine whether similar questions have existed or not, and if yes the system prefers to show these related questionanswers contained in its database before waiting for new answers from other users
The main parts of this paper include: section III presents the Convolutional Neural Network (CNN) model for question representation and for measuring similarity between two questions; Section IV presents different external knowledge sources and how to gain them; Section V is the important part in which we show how to integrate the external knowledge features into the CNN model

Summary

INTRODUCTION

In this paper, we address the problem to utilize different methods and different information sources for improving the accuracy of measuring question similarity as well as ranking the similar questions with respect to an input question To this objective, we firstly based on CNN, a very successful deep learning model, to formulate the problem of measuring the similarity between two questions. Various kinds of additional information have been used including word2vec representation which represents a word as a vector of real numbers; linguistic features such as words and name entities; question types and question categories, which are obtained by classification. From the CNN component we generate the joint representation containing miscellaneous features In another way, we can imagine that this model is an effective way of enhancing a deep learning model by providing complimentary additional knowledge, especially in the case of lacking training data.

RELATED WORK

MODELING CNN FOR QUESTION SIMILARITY MEASUREMENT

EXTERNAL KNOWLEDGE

Conventional Features

Question Type

Word Embedding

Question Category

THE EXTENDED CNN MODEL

Dataset

Setup Model’s Experimental Configures

Results

CONCLUSION

Findings

CONFLICT OF INTEREST

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Machine Learning and Computing	Publication Date: May 1, 2021
Citations: 2	License type: cc-by

R Discovery Prime

R Discovery Prime

A Model of Convolutional Neural Network Combined with External Knowledge to Measure the Question Similarity for Community Question Answering Systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Machine Learning and Computing

Lead the way for us

Similar Papers

Estimation and uncertainty analysis of groundwater quality parameters in a coastal aquifer under seawater intrusion: a comparative study of deep learning and classic machine learning methods.
Mehmet Taşan ... Sevda Taşan
Environmental Science and Pollution Research | VOL. 30
Mehmet Taşan, et. al.Mehmet Taşan ... Sevda Taşan
08 Aug 2022
Environmental Science and Pollution Research | VOL. 30

Interpreting convolutional neural network decision for earthquake detection with feature map visualization, backward optimization and layer-wise relevance propagation methods
Josipa Majstorović ... Piero Poli
Geophysical Journal International | VOL. 232
Josipa Majstorović, et. al.Josipa Majstorović ... Piero Poli
27 Sep 2022
Geophysical Journal International | VOL. 232

Auditory attention tracking states in a cocktail party environment can be decoded by deep convolutional neural networks
Yin Tian ... Liang Ma
Journal of Neural Engineering | VOL. 17
Yin Tian, et. al.Yin Tian ... Liang Ma
01 Jun 2020
Journal of Neural Engineering | VOL. 17

Development of hybrid models based on deep learning and optimized machine learning algorithms for brain tumor Multi-Classification
Muhammed Celik ... Ozkan Inik
Expert Systems with Applications | VOL. 238
Muhammed Celik, et. al.Muhammed Celik ... Ozkan Inik
18 Oct 2023
Expert Systems with Applications | VOL. 238

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Model of Convolutional Neural Network Combined with External Knowledge to Measure the Question Similarity for Community Question Answering Systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Machine Learning and Computing