Abstract

We propose a novel end-to-end approach, namely, the semantic-containing double-level embedding Bi-LSTM model (SCDE-Bi-LSTM), to solve the three key problems of Q&A matching in the Chinese medical field. In the similarity calculation of the Q&A core module, we propose a text similarity calculation method that contains semantic information, to solve the problem that previous Q&A methods do not incorporate the deep information of a sentence into the similarity calculations. For the sentence vector representation module, we present a double-level embedding sentence representation method to reduce the error caused by Chinese medical word segmentation. In addition, due to the problem of the attention mechanism tending to cause backward deviation of the features, we propose an improved algorithm based on Bi-LSTM in the feature extraction stage. The Q&A framework proposed in this paper not only retains important timing features but also loses low-frequency features and noise. Additionally, it is applicable to different domains. To verify the framework, extensive Chinese medical Q&A corpora are created. We run several state-of-the-art Q&A methods as contrastive experiments on the medical corpora and the current popular insuranceQA dataset under different performance measures. The experimental results on the medical corpora show that our framework significantly outperforms several strong baselines and achieves an improvement of top-1 accuracy of up to 14%, reaching 79.15%.

Highlights

  • Question answer selection is a subdirection in the field of question answering systems

  • With the development of natural language processing, question answering systems have been at the forefront of artificial intelligence research

  • The question answering system is divided into task robots, chat robots, and solution robots according to different application fields

Read more

Summary

Introduction

Question answer selection is a subdirection in the field of question answering systems. The Chinese medical question-and-answer matching is performed by a solution robot. YouWenBiDa (http:// www.120ask.com) provides an extensive medical Q&A information database for patients. Based on these massive resources, many meaningful studies related to medical Q&A are being carried out. We propose an SCDE-Bi-LSTM algorithm that greatly improves the accuracy of Q&A matching in the Chinese medical field. E data resources used throughout this paper are available on GitHub (https://github.com/Vitas-Xiong/ Chinese-Medical-Question-Answering-System), including the text of the original Q&A statements. To more accurately match the correct answer (a+) of q from the answer pool, the improved SCDE-Bi-LSTM is proposed to select the best answer in the Chinese medical Q&A corpora. It is recommended to treat it by taking Sishen Wan.)

Related Work
Methodology
Objective function
Result
Experiments and Results
Attentive pooling
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call