Towards the Resistance of Neural Network Watermarking to Fine-tuning

📅 2025-05-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses the vulnerability of deep neural network watermarks to fine-tuning, proposing a fine-tuning-resilient watermarking method operating in the frequency domain. The core insight is a theoretical proof that low-frequency Fourier components of convolutional kernels exhibit invariance and equivariance under fine-tuning, weight scaling, and permutation. Leveraging this property, we design a modified discrete Fourier transform to extract robust frequency-domain features and embed all watermark information exclusively into the low-frequency coefficients. Our approach integrates theory-driven robustness analysis, frequency-domain watermark embedding, and detection. Experiments demonstrate that the proposed watermark achieves >98% detection accuracy across diverse fine-tuning scenarios—including transfer learning, LoRA, and full- or partial-parameter fine-tuning—while inducing <0.3% degradation in model accuracy, significantly outperforming state-of-the-art methods.

Technology Category

Application Category

📝 Abstract
This paper proves a new watermarking method to embed the ownership information into a deep neural network (DNN), which is robust to fine-tuning. Specifically, we prove that when the input feature of a convolutional layer only contains low-frequency components, specific frequency components of the convolutional filter will not be changed by gradient descent during the fine-tuning process, where we propose a revised Fourier transform to extract frequency components from the convolutional filter. Additionally, we also prove that these frequency components are equivariant to weight scaling and weight permutations. In this way, we design a watermark module to encode the watermark information to specific frequency components in a convolutional filter. Preliminary experiments demonstrate the effectiveness of our method.
Problem

Research questions and friction points this paper is trying to address.

Develops robust DNN watermarking resistant to fine-tuning
Uses revised Fourier transform for frequency component extraction
Encodes watermarks in convolutional filter frequency components
Innovation

Methods, ideas, or system contributions that make the work stand out.

Robust watermarking via low-frequency feature constraints
Revised Fourier transform for frequency component extraction
Watermark encoding in equivariant convolutional filter frequencies
🔎 Similar Papers
No similar papers found.