Towards Robust Neural Rankers with Large Language Model: A Contrastive Training Approach

Ziyang Pan,Rongyu Liu,Daifeng Li,Kangjia Fan

doi:10.3390/app131810148

Abstract

Pre-trained language model-based neural rankers have been widely applied in information retrieval (IR). However, the robustness issue of current IR models has not received sufficient attention, which could significantly impact the user experience in practical applications. In this study, we focus on the defensive ability of IR models against query attacks while guaranteeing their retrieval performance. We discover that improving the robustness of IR models not only requires a focus on model architecture and training methods but is also closely related to the quality of data. Different from previous research, we use large language models (LLMs) to generate query variations with the same intent, which exhibit richer and more realistic expressions while maintaining consistent query intent. Based on LLM-generated query variations, we propose a novel contrastive training framework that substantially enhances the robustness of IR models to query perturbations. Specifically, we combine the contrastive loss in the representation space of query variations with the ranking loss in the retrieval training stage to improve the model’s ability to understand the underlying semantic information of queries. Experimental results on two public datasets, WikiQA and ANTIQUE, demonstrate that the proposed contrastive training approach effectively improves the robustness of models facing query attack scenarios while outperforming baselines in retrieval performance. Compared with the best baseline approach, the improvements in average robustness performance of Reranker IR models are 24.9%, 26.5%, 27.0%, and 75.0% on WikiQA and 8.7%, 1.9%, 6.3%, and 13.6% on ANTIQUE, in terms of the MAP (Mean Average Precision), MRR (Mean Reciprocal Rank), nDCG@10 (Normalized Discounted Cumulative Gain) and P@10 (Precision), respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Towards Robust Neural Rankers with Large Language Model: A Contrastive Training Approach

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Journal: Applied Sciences	Publication Date: Sep 8, 2023
License type: CC BY 4.0

Similar Papers

Information Retrieval meets Large Language Models: A strategic report from Chinese IR community
Qingyao Ai ...
AI Open | VOL. 4
Qingyao Ai, et. al.Qingyao Ai ...
01 Jan 2023
AI Open | VOL. 4

Automating Information Retrieval from Biodiversity Literature Using Large Language Models: A Case Study
Vamsi Krishna Kommineni ... Birgitta Koenig-Ries
Biodiversity Information Science and Standards | VOL. 8
Vamsi Krishna Kommineni, et. al.Vamsi Krishna Kommineni ... Birgitta Koenig-Ries
10 Sep 2024
Biodiversity Information Science and Standards | VOL. 8

Model tuning or prompt Tuning? a study of large language models for clinical concept and relation extraction
Cheng Peng ... Yonghui Wu
Journal of Biomedical Informatics | VOL. 153
Cheng Peng, et. al.Cheng Peng ... Yonghui Wu
26 Mar 2024
Journal of Biomedical Informatics | VOL. 153

FuseLinker: Leveraging LLM’s pre-trained text embeddings and domain knowledge to enhance GNN-based link prediction on biomedical knowledge graphs
Yongkang Xiao ... Rui Zhang
Journal of Biomedical Informatics | VOL. 158
Yongkang Xiao, et. al.Yongkang Xiao ... Rui Zhang
24 Sep 2024
Journal of Biomedical Informatics | VOL. 158

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Towards Robust Neural Rankers with Large Language Model: A Contrastive Training Approach

Abstract

Talk to us

Similar Papers

More From: Applied Sciences