Towards the Resistance of Neural Network Watermarking to Fine-tuning

📅 2025-05-02

📈 Citations: 0

✨ Influential: 0

career value

213K/year

🤖 AI Summary

This paper addresses the vulnerability of deep neural network watermarks to fine-tuning, proposing a fine-tuning-resilient watermarking method operating in the frequency domain. The core insight is a theoretical proof that low-frequency Fourier components of convolutional kernels exhibit invariance and equivariance under fine-tuning, weight scaling, and permutation. Leveraging this property, we design a modified discrete Fourier transform to extract robust frequency-domain features and embed all watermark information exclusively into the low-frequency coefficients. Our approach integrates theory-driven robustness analysis, frequency-domain watermark embedding, and detection. Experiments demonstrate that the proposed watermark achieves >98% detection accuracy across diverse fine-tuning scenarios—including transfer learning, LoRA, and full- or partial-parameter fine-tuning—while inducing <0.3% degradation in model accuracy, significantly outperforming state-of-the-art methods.

Technology Category

Application Category

📝 Abstract

This paper proves a new watermarking method to embed the ownership information into a deep neural network (DNN), which is robust to fine-tuning. Specifically, we prove that when the input feature of a convolutional layer only contains low-frequency components, specific frequency components of the convolutional filter will not be changed by gradient descent during the fine-tuning process, where we propose a revised Fourier transform to extract frequency components from the convolutional filter. Additionally, we also prove that these frequency components are equivariant to weight scaling and weight permutations. In this way, we design a watermark module to encode the watermark information to specific frequency components in a convolutional filter. Preliminary experiments demonstrate the effectiveness of our method.

Problem

Research questions and friction points this paper is trying to address.

Develops robust DNN watermarking resistant to fine-tuning

Uses revised Fourier transform for frequency component extraction

Encodes watermarks in convolutional filter frequency components

Innovation

Methods, ideas, or system contributions that make the work stand out.

Robust watermarking via low-frequency feature constraints

Revised Fourier transform for frequency component extraction

Watermark encoding in equivariant convolutional filter frequencies

🔎 Similar Papers

No similar papers found.