NeuPerm: Disrupting Malware Hidden in Neural Network Parameters by Leveraging Permutation Symmetry

📅 2025-10-23

📈 Citations: 0

✨ Influential: 0

career value

233K/year

🤖 AI Summary

Pre-trained model sharing faces emerging threats of parameter-level malware injection: adversaries can embed self-executing malicious code directly into neural network weights, enabling stealthy execution upon model deployment. This paper proposes the first defense mechanism leveraging neural network permutation symmetry—introducing controlled weight reordering perturbations to disrupt the structural integrity of embedded malware, without requiring quantization or fine-tuning. The method preserves model accuracy with negligible degradation (<0.5% drop) while completely neutralizing state-of-the-art parameter-level malware attacks across diverse architectures, including ResNet, ViT, and LLaMA-2. Its core contribution lies in the principled translation of permutation symmetry theory into a practical, lightweight, and architecture-agnostic defense—demonstrating theoretical rigor, deployment efficiency, and broad applicability across vision and language models.

Technology Category

Application Category

📝 Abstract

Pretrained deep learning model sharing holds tremendous value for researchers and enterprises alike. It allows them to apply deep learning by fine-tuning models at a fraction of the cost of training a brand-new model. However, model sharing exposes end-users to cyber threats that leverage the models for malicious purposes. Attackers can use model sharing by hiding self-executing malware inside neural network parameters and then distributing them for unsuspecting users to unknowingly directly execute them, or indirectly as a dependency in another software. In this work, we propose NeuPerm, a simple yet effec- tive way of disrupting such malware by leveraging the theoretical property of neural network permutation symmetry. Our method has little to no effect on model performance at all, and we empirically show it successfully disrupts state-of-the-art attacks that were only previously addressed using quantization, a highly complex process. NeuPerm is shown to work on LLMs, a feat that no other previous similar works have achieved. The source code is available at https://github.com/danigil/NeuPerm.git.

Problem

Research questions and friction points this paper is trying to address.

Disrupting malware hidden in neural network parameters

Addressing cyber threats from pretrained model sharing

Preventing execution of malicious code in neural networks

Innovation

Methods, ideas, or system contributions that make the work stand out.

Disrupts malware using neural network permutation symmetry

Preserves model performance while neutralizing hidden threats

Successfully protects large language models from attacks

🔎 Similar Papers

Unified Neural Backdoor Removal with Only Few Clean Samples through Unlearning and Relearning