GraphFaaS: Serverless GNN Inference for Burst-Resilient, Real-Time Intrusion Detection

📅 2025-11-13
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Traditional static Graph Neural Network (GNN) inference architectures fail to simultaneously achieve low latency and adaptability to bursty workloads in real-time provenance-graph-based intrusion detection. Method: This paper proposes the first serverless architecture tailored for GNN inference, decoupling the GNN inference pipeline and migrating it to a Function-as-a-Service (FaaS) environment. It integrates graph data sharding, fine-grained function-level parallelism, and dynamic elastic scaling to enable on-demand decoupling of computational resources from graph processing tasks. Contribution/Results: Experiments demonstrate that our approach reduces average detection latency by 85% and decreases the coefficient of latency variation by 64% compared to baseline methods. This significantly enhances both real-time responsiveness and system stability under dynamic network traffic conditions.

Technology Category

Application Category

📝 Abstract
Provenance-based intrusion detection is an increasingly popular application of graphical machine learning in cybersecurity, where system activities are modeled as provenance graphs to capture causality and correlations among potentially malicious actions. Graph Neural Networks (GNNs) have demonstrated strong performance in this setting. However, traditional statically-provisioned GNN inference architectures fall short in meeting two crucial demands of intrusion detection: (1) maintaining consistently low detection latency, and (2) handling highly irregular and bursty workloads. To holistically address these challenges, we present GraphFaaS, a serverless architecture tailored for GNN-based intrusion detection. GraphFaaS leverages the elasticity and agility of serverless computing to dynamically scale the GNN inference pipeline. We parallelize and adapt GNN workflows to a serverless environment, ensuring that the system can respond in real time to fluctuating workloads. By decoupling compute resources from static provisioning, GraphFaaS delivers stable inference latency, which is critical for dependable intrusion detection and timely incident response in cybersecurity operations. Preliminary evaluation shows GraphFaaS reduces average detection latency by 85% and coefficient of variation (CV) by 64% compared to the baseline.
Problem

Research questions and friction points this paper is trying to address.

Achieving stable low-latency GNN inference for intrusion detection
Handling bursty workloads in real-time cybersecurity systems
Overcoming static provisioning limitations in graph-based malware analysis
Innovation

Methods, ideas, or system contributions that make the work stand out.

Serverless architecture for GNN inference
Dynamic scaling to handle bursty workloads
Decoupling compute from static provisioning
🔎 Similar Papers
No similar papers found.