๐ค AI Summary
Existing black-box watermarking methods for deep learning models struggle to simultaneously achieve imperceptibility (resistance to detection and forgery) and robustness (resistance to removal). This paper proposes the first frequency-domain compression-based watermarking framework tailored for black-box settings: it embeds imperceptible and removal-resistant watermarked samples in the input space via DCT/DFT transformation, high-frequency filtering, and adversarial simulation training. A similarity-driven loss function is designed to jointly optimize both objectives. Our method is the first to unify and enhance both imperceptibility and robustness under black-box constraints, supporting cross-modal tasksโincluding speech, text, generative modeling, and video. Extensive experiments across multiple datasets and model architectures demonstrate state-of-the-art performance: significantly improved imperceptibility and watermark detection rates exceeding 92% against prevalent removal attacks such as fine-tuning, knowledge distillation, and pruning.
๐ Abstract
The rapid advancement of deep learning has turned models into highly valuable assets due to their reliance on massive data and costly training processes. However, these models are increasingly vulnerable to leakage and theft, highlighting the critical need for robust intellectual property protection. Model watermarking has emerged as an effective solution, with black-box watermarking gaining significant attention for its practicality and flexibility. Nonetheless, existing black-box methods often fail to better balance covertness (hiding the watermark to prevent detection and forgery) and robustness (ensuring the watermark resists removal)-two essential properties for real-world copyright verification. In this paper, we propose ComMark, a novel black-box model watermarking framework that leverages frequency-domain transformations to generate compressed, covert, and attack-resistant watermark samples by filtering out high-frequency information. To further enhance watermark robustness, our method incorporates simulated attack scenarios and a similarity loss during training. Comprehensive evaluations across diverse datasets and architectures demonstrate that ComMark achieves state-of-the-art performance in both covertness and robustness. Furthermore, we extend its applicability beyond image recognition to tasks including speech recognition, sentiment analysis, image generation, image captioning, and video recognition, underscoring its versatility and broad applicability.