Vul-LMGNNs: Fusing language models and online-distilled graph neural networks for code vulnerability detection

📅 2024-04-23
🏛️ Information Fusion
📈 Citations: 8
Influential: 0
📄 PDF
🤖 AI Summary
Existing code vulnerability detection methods face two key limitations: graph neural networks (GNNs) suffer from restricted receptive fields due to local neighborhood aggregation, hindering long-range structural dependency modeling; while code large language models (code LLMs) excel in semantic understanding, their synergy with GNNs remains underexplored. This paper proposes LLM-GNN, a novel collaborative framework that unifies LLM-driven semantic comprehension with lightweight, online-distilled GNN-based structural modeling in an end-to-end architecture. We introduce a dynamic graph structure online distillation mechanism tailored for vulnerability detection, enabling efficient knowledge transfer over fused abstract syntax tree (AST) and control flow graph (CFG) representations. Evaluated on multiple benchmarks, our method achieves an F1-score of 92.7%, outperforming state-of-the-art approaches by an average of 4.3 percentage points, while accelerating inference by 2.1× and significantly reducing deployment overhead.

Technology Category

Application Category

Problem

Research questions and friction points this paper is trying to address.

Enhancing code vulnerability detection by fusing language models and graph neural networks
Overcoming limited structural information propagation in traditional GNN approaches
Improving collaboration between codeLMs and GNNs for semantic-structural analysis
Innovation

Methods, ideas, or system contributions that make the work stand out.

Fuses pre-trained codeLMs with GNNs
Uses online knowledge distillation mechanism
Implements implicit-explicit joint training
🔎 Similar Papers
No similar papers found.