PRADA: Practical Black-box Adversarial Attacks against Neural Ranking Models

Chen Wu,Maarten De Rijke,Jiafeng Guo,Xueqi Cheng,Yixing Fan,Ruqing Zhang

doi:10.1145/3576923

Abstract

Neural ranking models (NRMs) have shown remarkable success in recent years, especially with pre-trained language models. However, deep neural models are notorious for their vulnerability to adversarial examples. Adversarial attacks may become a new type of web spamming technique given our increased reliance on neural information retrieval models. Therefore, it is important to study potential adversarial attacks to identify vulnerabilities of NRMs before they are deployed. In this article, we introduce the Word Substitution Ranking Attack (WSRA) task against NRMs, which aims at promoting a target document in rankings by adding adversarial perturbations to its text. We focus on the decision-based black-box attack setting, where the attackers cannot directly get access to the model information, but can only query the target model to obtain the rank positions of the partial retrieved list. This attack setting is realistic in real-world search engines. We propose a novel Pseudo Relevance-based ADversarial ranking Attack method (PRADA) that learns a surrogate model based on Pseudo Relevance Feedback (PRF) to generate gradients for finding the adversarial perturbations. Experiments on two web search benchmark datasets show that PRADA can outperform existing attack strategies and successfully fool the NRM with small indiscernible perturbations of text.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

PRADA: Practical Black-box Adversarial Attacks against Neural Ranking Models

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Information Systems

Lead the way for us

Journal: ACM Transactions on Information Systems	Publication Date: Apr 8, 2023
Citations: 14

Similar Papers

Rethinking Textual Adversarial Defense for Pre-Trained Language Models
Jiayi Wang ... Rongzhou Bao
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30
Jiayi Wang, et. al.Jiayi Wang ... Rongzhou Bao
01 Jan 2021
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30

Lambertian-based adversarial attacks on deep-learning-based underwater side-scan sonar image classification
Qixiang Ma ... Wenxue Yu
Pattern Recognition | VOL. 138
Qixiang Ma, et. al.Qixiang Ma ... Wenxue Yu
08 Feb 2023
Pattern Recognition | VOL. 138

Fooling deep neural detection networks with adaptive object-oriented adversarial perturbation
Yatie Xiao ... Bo Liu
Pattern Recognition | VOL. 115
Yatie Xiao, et. al.Yatie Xiao ... Bo Liu
20 Feb 2021
Pattern Recognition | VOL. 115

Adversarial example generation with adaptive gradient search for single and ensemble deep neural network
Yatie Xiao ... Bo Liu
Information Sciences | VOL. 528
Yatie Xiao, et. al.Yatie Xiao ... Bo Liu
14 Apr 2020
Information Sciences | VOL. 528

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PRADA: Practical Black-box Adversarial Attacks against Neural Ranking Models

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Information Systems