🤖 AI Summary
This work addresses the gap between existing multi-bit generative watermarking schemes for large language models and their theoretical performance limits under worst-case false positive constraints. By formulating the problem as a linear program, the authors propose two novel structured encoding–decoding schemes that provably achieve the theoretical lower bound on missed detection probability for finite text lengths, thereby attaining optimal watermark performance in this setting for the first time. The study not only uncovers the fundamental reasons behind the suboptimality of prior approaches but also systematically characterizes the practical trade-offs between the two proposed schemes, offering a complete solution to the optimal design of multi-bit generative watermarks.
📝 Abstract
This paper considers the problem of multi-bit generative watermarking for large language models under a worst-case false-alarm constraint. Prior work established a lower bound on the achievable miss-detection probability in the finite-token regime and proposed a scheme claimed to achieve this bound. We show, however, that the proposed scheme is in fact suboptimal. We then develop two new encoding-decoding constructions that attain the previously established lower bound, thereby completely characterizing the optimal multi-bit watermarking performance. Our approach formulates the watermark design problem as a linear program and derives the structural conditions under which optimality can be achieved. In addition, we identify the failure mechanism of the previous construction and compare the tradeoffs between the two proposed schemes.