Robustness of generative AI detection: adversarial attacks on black-box neural text detectors

Vitalii Fishchuk,Daniel Braun

doi:10.1007/s10772-024-10144-2

Vitalii Fishchuk, Daniel Braun

Open Access

https://doi.org/10.1007/s10772-024-10144-2

Copy DOI

Export

Save

Cite

Journal: International Journal of Speech Technology	Publication Date: Oct 16, 2024
License type: CC BY 4.0

Abstract
Full-Text
Similar Papers

Abstract

Listen

The increased quality and human-likeness of AI generated texts has resulted in a rising demand for neural text detectors, i.e. software that is able to detect whether a text was written by a human or generated by an AI. Such tools are often used in contexts where the use of AI is restricted or completely prohibited, e.g. in educational contexts. It is, therefore, important for the effectiveness of such tools that they are robust towards deliberate attempts to hide the fact that a text was generated by an AI. In this article, we investigate a broad range of adversarial attacks in English texts with six different neural text detectors, including commercial and research tools. While the results show that no detector is completely invulnerable to adversarial attacks, the latest generation of commercial detectors proved to be very robust and not significantly influenced by most of the evaluated attack strategies.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Robustness of generative AI detection: adversarial attacks on black-box neural text detectors

Abstract

Published Version

Talk to us

Similar Papers

More From: International Journal of Speech Technology

Lead the way for us

Similar Papers

Metrics for evaluating adversarial attack patterns
Savanna Smith ... Joshua D Harguess
-
Savanna Smith, et. al.Savanna Smith ... Joshua D Harguess
27 May 2022
27 May 2022

Adversarial Attack and Defense Strategies of Speaker Recognition Systems: A Survey
Hao Tan ... Junjian Zhang
Electronics | VOL. 11
Hao Tan, et. al.Hao Tan ... Junjian Zhang
12 Jul 2022
Electronics | VOL. 11

PRADA: Practical Black-box Adversarial Attacks against Neural Ranking Models
Chen Wu ... Maarten De Rijke
ACM Transactions on Information Systems | VOL. 41
Chen Wu, et. al.Chen Wu ... Maarten De Rijke
08 Apr 2023
ACM Transactions on Information Systems | VOL. 41

Adversarial example generation with adaptive gradient search for single and ensemble deep neural network
Yatie Xiao ... Bo Liu
Information Sciences | VOL. 528
Yatie Xiao, et. al.Yatie Xiao ... Bo Liu
14 Apr 2020
Information Sciences | VOL. 528

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Robustness of generative AI detection: adversarial attacks on black-box neural text detectors

Abstract

Published Version

Talk to us

Similar Papers

More From: International Journal of Speech Technology