dgMARK: Decoding-Guided Watermarking for Diffusion Language Models

📅 2026-01-30
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the lack of effective watermarking mechanisms for discrete diffusion language models (dLLMs) by proposing a plug-and-play text watermarking method that requires no modification to the model’s probability distribution. The approach leverages, for the first time, the sensitivity of dLLMs to demasking order as a watermark channel, embedding high-reward tokens that satisfy binary hash parity constraints through guided decoding sequences, augmented with a one-step lookahead strategy. During detection, a sliding-window statistical test enables robust watermark identification while remaining compatible with mainstream decoding strategies—including confidence-, entropy-, and margin-based ranking—and effectively resists common post-editing attacks such as insertion, deletion, substitution, and paraphrasing. Experimental results demonstrate that the method achieves high watermark robustness without compromising text generation quality, making it suitable for practical deployment.

Technology Category

Application Category

📝 Abstract
We propose dgMARK, a decoding-guided watermarking method for discrete diffusion language models (dLLMs). Unlike autoregressive models, dLLMs can generate tokens in arbitrary order. While an ideal conditional predictor would be invariant to this order, practical dLLMs exhibit strong sensitivity to the unmasking order, creating a new channel for watermarking. dgMARK steers the unmasking order toward positions whose high-reward candidate tokens satisfy a simple parity constraint induced by a binary hash, without explicitly reweighting the model's learned probabilities. The method is plug-and-play with common decoding strategies (e.g., confidence, entropy, and margin-based ordering) and can be strengthened with a one-step lookahead variant. Watermarks are detected via elevated parity-matching statistics, and a sliding-window detector ensures robustness under post-editing operations including insertion, deletion, substitution, and paraphrasing.
Problem

Research questions and friction points this paper is trying to address.

watermarking
diffusion language models
unmasking order
text generation
content attribution
Innovation

Methods, ideas, or system contributions that make the work stand out.

decoding-guided watermarking
diffusion language models
unmasking order
parity constraint
robust detection
🔎 Similar Papers
No similar papers found.