Wikiformer: Pre-training with Structured Information of Wikipedia for Ad-Hoc Retrieval

Weihang Su,Yiqun Liu,Jia Chen,Xiaolong Wu,Qingyao Ai,Shengluan Hou,Xiangsheng Li

doi:10.1609/aaai.v38i17.29869

Abstract

With the development of deep learning and natural language processing techniques, pre-trained language models have been widely used to solve information retrieval (IR) problems. Benefiting from the pre-training and fine-tuning paradigm, these models achieve state-of-the-art performance. In previous works, plain texts in Wikipedia have been widely used in the pre-training stage. However, the rich structured information in Wikipedia, such as the titles, abstracts, hierarchical heading (multi-level title) structure, relationship between articles, references, hyperlink structures, and the writing organizations, has not been fully explored. In this paper, we devise four pre-training objectives tailored for IR tasks based on the structured knowledge of Wikipedia. Compared to existing pre-training methods, our approach can better capture the semantic knowledge in the training corpus by leveraging the human-edited structured data from Wikipedia. Experimental results on multiple IR benchmark datasets show the superior performance of our model in both zero-shot and fine-tuning settings compared to existing strong retrieval baselines. Besides, experimental results in biomedical and legal domains demonstrate that our approach achieves better performance in vertical domains compared to previous models, especially in scenarios where long text similarity matching is needed. The code is available at https://github.com/oneal2000/Wikiformer.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Wikiformer: Pre-training with Structured Information of Wikipedia for Ad-Hoc Retrieval

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

Developing and Analyzing Deep Learning and Natural Language Processing Systems in the Context of Medical Information Processing
Emmanuel, Victoria Nkemjika ... Ogbonna Tochukwu Loveday
International Journal of Research and Innovation in Applied Science | VOL. 9
Emmanuel, Victoria Nkemjika, et. al.Emmanuel, Victoria Nkemjika ... Ogbonna Tochukwu Loveday
01 Jan 2024
International Journal of Research and Innovation in Applied Science | VOL. 9

Uncovering Semantic Inconsistencies and Deceptive Language in False News Using Deep Learning and NLP Techniques for Effective Management
Yash Chopra ... Saurabh Pratap Singh Rathore
International Journal on Recent and Innovation Trends in Computing and Communication | VOL. 11
Yash Chopra, et. al.Yash Chopra ... Saurabh Pratap Singh Rathore
18 Aug 2023
International Journal on Recent and Innovation Trends in Computing and Communication | VOL. 11

Deep Learning and Natural Language Processing Technology Based Display and Analysis of Modern Artwork
Xiongfei Li, Yongjun Li
Journal of Electrical Systems | VOL. 20
Xiongfei Li, Yongjun LiXiongfei Li, Yongjun Li
04 Apr 2024
Journal of Electrical Systems | VOL. 20

Depression Detection in Tweets from Urban Cities of Malaysia using Deep Learning
E.K Priya Sri ... Maryam Zaffar
-
E.K Priya Sri, et. al.E.K Priya Sri ... Maryam Zaffar
25 Oct 2021
25 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Wikiformer: Pre-training with Structured Information of Wikipedia for Ad-Hoc Retrieval

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence