DualGuard: Dual-stream Large Language Model Watermarking Defense against Paraphrase and Spoofing Attack

๐Ÿ“… 2025-12-18
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
To address the challenge that existing large language model (LLM) watermarking techniques struggle to simultaneously defend against paraphrasing attacks and parasitic forgery attacks, this paper proposes DualGuardโ€”the first semantic-adaptive dual-stream watermarking framework unifying defense against both attack types. Methodologically, DualGuard introduces a semantics-driven dual-stream embedding mechanism that dynamically injects complementary watermark signals, enabling robust detection and traceable provenance attribution. It further incorporates adversarial robustness optimization and adaptive watermark injection to preserve textual quality (BLEU degradation < 0.8) while achieving >98% detection accuracy across multiple LLMs and benchmark datasets. The framework significantly enhances watermark reliability and attribution trustworthiness, setting a new standard for practical, attack-resilient LLM watermarking.

Technology Category

Application Category

๐Ÿ“ Abstract
With the rapid development of cloud-based services, large language models (LLMs) have become increasingly accessible through various web platforms. However, this accessibility has also led to growing risks of model abuse. LLM watermarking has emerged as an effective approach to mitigate such misuse and protect intellectual property. Existing watermarking algorithms, however, primarily focus on defending against paraphrase attacks while overlooking piggyback spoofing attacks, which can inject harmful content, compromise watermark reliability, and undermine trust in attribution. To address this limitation, we propose DualGuard, the first watermarking algorithm capable of defending against both paraphrase and spoofing attacks. DualGuard employs the adaptive dual-stream watermarking mechanism, in which two complementary watermark signals are dynamically injected based on the semantic content. This design enables DualGuard not only to detect but also to trace spoofing attacks, thereby ensuring reliable and trustworthy watermark detection. Extensive experiments conducted across multiple datasets and language models demonstrate that DualGuard achieves excellent detectability, robustness, traceability, and text quality, effectively advancing the state of LLM watermarking for real-world applications.
Problem

Research questions and friction points this paper is trying to address.

Defends against paraphrase and spoofing attacks in LLMs
Ensures reliable watermark detection and traceability
Protects intellectual property from model abuse
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dual-stream watermarking defends against both paraphrase and spoofing attacks
Adaptive dual-stream mechanism injects complementary signals based on semantics
Enables detection and tracing of spoofing attacks for reliable attribution
๐Ÿ”Ž Similar Papers
No similar papers found.
H
Hao Li
Institute of Information Engineering, Chinese Academy of Sciences, School of Cyber Security, University of Chinese Academy of Sciences, Beijing, China
Y
Yubing Ren
Institute of Information Engineering, Chinese Academy of Sciences, School of Cyber Security, University of Chinese Academy of Sciences, Beijing, China
Yanan Cao
Yanan Cao
Institute of Information Engineering, Chinese Academy of Sciences
Y
Yingjie Li
Institute of Information Engineering, Chinese Academy of Sciences, School of Cyber Security, University of Chinese Academy of Sciences, Beijing, China
F
Fang Fang
Institute of Information Engineering, Chinese Academy of Sciences, School of Cyber Security, University of Chinese Academy of Sciences, Beijing, China
Shi Wang
Shi Wang
Institute of Computing Technology
knowledge graphnatural language processingneural-symbolic dual-process computing
L
Li Guo
Institute of Information Engineering, Chinese Academy of Sciences, School of Cyber Security, University of Chinese Academy of Sciences, Beijing, China