Publicly-Detectable Watermarking for Language Models

Jaiden Fairoze,Sanjam Garg,Somesh Jha,Saeed Mahloujifar,Mohammad Mahmoody,Mingyuan Wang

doi:10.62056/ahmpdkp10

Publicly-Detectable Watermarking for Language Models

Jaiden Fairoze, Sanjam Garg + Show 4 more

https://doi.org/10.62056/ahmpdkp10

Copy DOI

Export

Save

Cite

Journal: IACR Communications in Cryptology	Publication Date: Jan 13, 2025
License type: CC BY 4.0

#Formal Claims #Text Output #Rejection Sampling #Watermarking Scheme #Secret Information #Detection Algorithm #Low Entropy #Cryptographic Signature

Abstract
Full-Text
Similar Papers

Abstract

Listen

We present a publicly-detectable watermarking scheme for LMs: the detection algorithm contains no secret information, and it is executable by anyone. We embed a publicly-verifiable cryptographic signature into LM output using rejection sampling and prove that this produces unforgeable and distortion-free (i.e., undetectable without access to the public key) text output. We make use of error-correction to overcome periods of low entropy, a barrier for all prior watermarking schemes. We implement our scheme and find that our formal claims are met in practice.

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: IACR Communications in Cryptology

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

Publicly-Detectable Watermarking for Language Models