π€ AI Summary
This work addresses the challenge that existing text watermarking methods struggle to simultaneously preserve generation quality and ensure sufficient watermark strength when embedding long messages, thereby limiting traceability. To overcome this, the authors propose a distortion-free multi-bit watermarking framework that enhances watermark signals through multi-channel coloring reweighting and multi-layer sequence reweighting mechanisms, all while strictly maintaining the unbiased token distribution of the language modelβs output. Additionally, an evidence accumulation detector is designed to enable high-precision decoding. The method supports high-fidelity embedding and recovery of long messages, achieving near-perfect decoding accuracy for short messages and improving long-message detection performance by nearly 30% over the next-best approach, all without compromising text quality.
π Abstract
Large language models now produce text indistinguishable from human writing, which increases the need for reliable provenance tracing. Multi-bit watermarking can embed identifiers into generated text, but existing methods struggle to keep both text quality and watermark strength while carrying long messages. We propose MC$^2$Mark, a distortion-free multi-bit watermarking framework designed for reliable embedding and decoding of long messages. Our key technical idea is Multi-Channel Colored Reweighting, which encodes bits through structured token reweighting while keeping the token distribution unbiased, together with Multi-Layer Sequential Reweighting to strengthen the watermark signal and an evidence-accumulation detector for message recovery. Experiments show that MC$^2$Mark improves detectability and robustness over prior multi-bit watermarking methods while preserving generation quality, achieving near-perfect accuracy for short messages and exceeding the second-best method by nearly 30% for long messages.