Watermarking Conditional Text Generation for AI Detection: Unveiling Challenges and a Semantic-Aware Watermark Remedy

Yu Fu,Deyi Xiong,Yue Dong

doi:10.1609/aaai.v38i16.29756

Abstract

To mitigate potential risks associated with language models (LMs), recent AI detection research proposes incorporating watermarks into machine-generated text through random vocabulary restrictions and utilizing this information for detection. In this paper, we show that watermarking algorithms designed for LMs cannot be seamlessly applied to conditional text generation (CTG) tasks without a notable decline in downstream task performance. To address this issue, we introduce a simple yet effective semantic-aware watermarking algorithm that considers the characteristics of conditional text generation with the input context. Compared to the baseline watermarks, our proposed watermark yields significant improvements in both automatic and human evaluations across various text generation models, including BART and Flan-T5, for CTG tasks such as summarization and data-to-text generation. Meanwhile, it maintains detection ability with higher z-scores but lower AUC scores, suggesting the presence of a detection paradox that poses additional challenges for watermarking CTG.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Watermarking Conditional Text Generation for AI Detection: Unveiling Challenges and a Semantic-Aware Watermark Remedy

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

Topic-Guided Variational Auto-Encoder for Text Generation
Wenlin Wang ... Ruiyi Zhang
-
Wenlin Wang, et. al.Wenlin Wang ... Ruiyi Zhang
01 Jan 2019
01 Jan 2019

Conditional Text Generation for Harmonious Human-Machine Interaction
Bin Guo ... Wei Wu
ACM Transactions on Intelligent Systems and Technology | VOL. 12
Bin Guo, et. al.Bin Guo ... Wei Wu
26 Feb 2021
ACM Transactions on Intelligent Systems and Technology | VOL. 12

Automatic generation of sentimental texts via mixture adversarial networks
K Wang ... X Wan
Artificial Intelligence | VOL. 275
K Wang, et. al.K Wang ... X Wan
19 Jul 2019
Artificial Intelligence | VOL. 275

Data-to-Text Generation with Attention Recurrent Unit
Hechong Wang ... Zhiqiang Bai
-
Hechong Wang, et. al.Hechong Wang ... Zhiqiang Bai
01 Jul 2019
01 Jul 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Watermarking Conditional Text Generation for AI Detection: Unveiling Challenges and a Semantic-Aware Watermark Remedy

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence