Tarallo: Evading Behavioral Malware Detectors in the Problem Space

📅 2025-06-03

🏛️ International Conference on Detection of intrusions and malware, and vulnerability assessment

📈 Citations: 0

✨ Influential: 0

career value

217K/year

🤖 AI Summary

Behavioral malware detectors are vulnerable to adversarial attacks in both feature and problem spaces and struggle with malware’s non-deterministic execution. To address this, we propose the first end-to-end adversarial framework explicitly accounting for execution stochasticity. Our method introduces: (1) PS-FGSM, a novel gradient-based perturbation algorithm tailored for sequential behavioral features; (2) a sandbox-guided, code-level problem-space modification strategy ensuring consistent evasion across multiple executions; and (3) a unified white-box/black-box attack architecture leveraging RNN-based dynamic behavioral modeling. Evaluated on two state-of-the-art RNN-based detectors, our approach achieves a 99% attack success rate while significantly reducing required code modifications. It outperforms existing methods substantially in both white-box and black-box settings, demonstrating superior robustness, efficiency, and cross-execution consistency.

Technology Category

Application Category

📝 Abstract

Machine learning algorithms can effectively classify malware through dynamic behavior but are susceptible to adversarial attacks. Existing attacks, however, often fail to find an effective solution in both the feature and problem spaces. This issue arises from not addressing the intrinsic nondeterministic nature of malware, namely executing the same sample multiple times may yield significantly different behaviors. Hence, the perturbations computed for a specific behavior may be ineffective for others observed in subsequent executions. In this paper, we show how an attacker can augment their chance of success by leveraging a new and more efficient feature space algorithm for sequential data, which we have named PS-FGSM, and by adopting two problem space strategies specially tailored to address nondeterminism in the problem space. We implement our novel algorithm and attack strategies in Tarallo, an end-to-end adversarial framework that significantly outperforms previous works in both white and black-box scenarios. Our preliminary analysis in a sandboxed environment and against two RNN-based malware detectors, shows that Tarallo achieves a success rate up to 99% on both feature and problem space attacks while significantly minimizing the number of modifications required for misclassification.

Problem

Research questions and friction points this paper is trying to address.

Overcoming nondeterministic malware behavior in adversarial attacks

Enhancing attack success with efficient feature space algorithm PS-FGSM

Achieving high success rate in white and black-box scenarios

Innovation

Methods, ideas, or system contributions that make the work stand out.

PS-FGSM algorithm for sequential data

Problem space strategies for nondeterminism

Tarallo framework for adversarial attacks

🔎 Similar Papers

Minerva: A File-Based Ransomware Detector