On the Consistency of GNN Explanations for Malware Detection

📅 2025-04-22

📈 Citations: 0

✨ Influential: 0

career value

221K/year

🤖 AI Summary

This work addresses two key challenges—interpretability and explanation consistency—in control-flow-graph (CFG)-based malware detection using graph neural networks (GNNs). To tackle them, we propose a rule-driven dynamic CFG construction method that integrates expert-crafted rules with autoencoder-derived node embeddings for feature generation. We further design RankFusion, a multi-explainer aggregation mechanism, and Greedy Edge-wise Composition (GEC), a subgraph extraction strategy, enabling the first systematic evaluation and enhancement of explanation consistency in GNN-based malware detection. Experiments demonstrate significant improvements over baselines in detection accuracy, explanation fidelity, and consistency metrics. RankFusion and GEC jointly enhance structural coherence and explanatory reliability. Our approach establishes verifiable explanation boundaries, advancing trustworthy GNN deployment in security-critical applications.

Technology Category

Application Category

📝 Abstract

Control Flow Graphs (CFGs) are critical for analyzing program execution and characterizing malware behavior. With the growing adoption of Graph Neural Networks (GNNs), CFG-based representations have proven highly effective for malware detection. This study proposes a novel framework that dynamically constructs CFGs and embeds node features using a hybrid approach combining rule-based encoding and autoencoder-based embedding. A GNN-based classifier is then constructed to detect malicious behavior from the resulting graph representations. To improve model interpretability, we apply state-of-the-art explainability techniques, including GNNExplainer, PGExplainer, and CaptumExplainer, the latter is utilized three attribution methods: Integrated Gradients, Guided Backpropagation, and Saliency. In addition, we introduce a novel aggregation method, called RankFusion, that integrates the outputs of the top-performing explainers to enhance the explanation quality. We also evaluate explanations using two subgraph extraction strategies, including the proposed Greedy Edge-wise Composition (GEC) method for improved structural coherence. A comprehensive evaluation using accuracy, fidelity, and consistency metrics demonstrates the effectiveness of the proposed framework in terms of accurate identification of malware samples and generating reliable and interpretable explanations.

Problem

Research questions and friction points this paper is trying to address.

Dynamic CFG construction for malware detection using GNNs

Improving GNN explainability with hybrid embedding and RankFusion

Evaluating explanation consistency via subgraph extraction strategies

Innovation

Methods, ideas, or system contributions that make the work stand out.

Hybrid rule-based and autoencoder CFG embedding

RankFusion aggregates top explainers' outputs

Greedy Edge-wise Composition enhances subgraph coherence

🔎 Similar Papers

Explainable Artificial Intelligence (XAI) for Malware Analysis: A Survey of Techniques, Applications, and Open Challenges