ComMark: Covert and Robust Black-Box Model Watermarking with Compressed Samples

๐Ÿ“… 2025-12-16
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Existing black-box watermarking methods for deep learning models struggle to simultaneously achieve imperceptibility (resistance to detection and forgery) and robustness (resistance to removal). This paper proposes the first frequency-domain compression-based watermarking framework tailored for black-box settings: it embeds imperceptible and removal-resistant watermarked samples in the input space via DCT/DFT transformation, high-frequency filtering, and adversarial simulation training. A similarity-driven loss function is designed to jointly optimize both objectives. Our method is the first to unify and enhance both imperceptibility and robustness under black-box constraints, supporting cross-modal tasksโ€”including speech, text, generative modeling, and video. Extensive experiments across multiple datasets and model architectures demonstrate state-of-the-art performance: significantly improved imperceptibility and watermark detection rates exceeding 92% against prevalent removal attacks such as fine-tuning, knowledge distillation, and pruning.

Technology Category

Application Category

๐Ÿ“ Abstract
The rapid advancement of deep learning has turned models into highly valuable assets due to their reliance on massive data and costly training processes. However, these models are increasingly vulnerable to leakage and theft, highlighting the critical need for robust intellectual property protection. Model watermarking has emerged as an effective solution, with black-box watermarking gaining significant attention for its practicality and flexibility. Nonetheless, existing black-box methods often fail to better balance covertness (hiding the watermark to prevent detection and forgery) and robustness (ensuring the watermark resists removal)-two essential properties for real-world copyright verification. In this paper, we propose ComMark, a novel black-box model watermarking framework that leverages frequency-domain transformations to generate compressed, covert, and attack-resistant watermark samples by filtering out high-frequency information. To further enhance watermark robustness, our method incorporates simulated attack scenarios and a similarity loss during training. Comprehensive evaluations across diverse datasets and architectures demonstrate that ComMark achieves state-of-the-art performance in both covertness and robustness. Furthermore, we extend its applicability beyond image recognition to tasks including speech recognition, sentiment analysis, image generation, image captioning, and video recognition, underscoring its versatility and broad applicability.
Problem

Research questions and friction points this paper is trying to address.

Balances covertness and robustness in black-box model watermarking
Generates compressed, attack-resistant watermark samples using frequency-domain transformations
Extends watermarking applicability to various tasks beyond image recognition
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses frequency-domain transformations for compressed samples
Incorporates simulated attacks and similarity loss training
Extends applicability to multiple tasks beyond image recognition
๐Ÿ”Ž Similar Papers
No similar papers found.
Yunfei Yang
Yunfei Yang
Institute of Information Engineering, Chinese Academy of Sciences
AI SecurityModel ExtractionModel WatermarkingLarge Model Security
X
Xiaojun Chen
Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China
Z
Zhendong Zhao
Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China
Y
Yu Zhou
College of Computer Science, Nankai University, Tianjin, China
X
Xiaoyan Gu
Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China
Juan Cao
Juan Cao
Professor of Mathematics, Xiamen University
Computer Aided Geometric DesignComputer Graphics