Abstract

Community Question Answering (CQA) forums, such as Stack Overflow, Stack Exchange and Massive Open Online Course (MOOC) forums, spend a lot of manpower and time to manage duplicate questions on the forum. Mismatch of duplicate questions makes users keep asking “new” questions, and the continuous accumulation of duplicate questions may interfere with their information searching again, affecting user satisfaction. Neural Networks (NN) models for parsing semantics provide the possibility of end-to-end duplicate question detection. Whereas, due to lack of domain data and expertise, NN models for semantic parsing are rarely directly applied to CQA duplicate question detection. This paper proposes a Semantic Matching Model (SMM) integrated with the multi-task transfer learning framework for multi-domain forum duplicate question detection. By designing the word-to-sentence interaction mechanism based on the word-to-word interaction, SMM can automatically choose to ignore or pay attention to potential similar words according to the semantics at the sentence level. The experiments on the benchmark data set and MOOC forum data set state that SMM outperforms baselines, its interaction mechanism is effective and it has an advantage in cross-domain duplicate question detection.

Highlights

  • The Community Question Answering (CQA) forum plays an important role in helping users find solutions to their questions and promoting online learning

  • To solve the above problems, this paper proposed a Semantic Matching Model (SMM) with multi-task transfer learning for multi-domain forum duplicate question detection

  • On the basis of hCNN [18], this paper proposed SMM, and with the help of multi-task transfer learning, the model was well adapted to multi-domain duplicate question detection

Read more

Summary

INTRODUCTION

The CQA forum plays an important role in helping users find solutions to their questions and promoting online learning. To solve the above problems, this paper proposed a Semantic Matching Model (SMM) with multi-task transfer learning for multi-domain forum duplicate question detection. The experimental results show that SMM can effectively extract the interactive relationships between sentences and it is well suited for duplicate question detection in small domains. The rest of this paper is organized as follows: Part II introduces the related work on duplicate question detection and semantic matching; Part III introduces the multi-task framework and the SMM; Part IV reports experimental settings and analyzes the results; The Part V summarizes

RELATED WORK
METHODS
SEMANTIC MATCHING MAODEL
EXPERIMENTS
Findings
CONCLUSION AND FUTURE WORK
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.