AutoVulnPHP: LLM-Powered Two-Stage PHP Vulnerability Detection and Automated Localization

📅 2026-01-07
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses long-standing challenges in PHP vulnerability detection, including insufficient semantic precision in static analysis, high overhead in dynamic analysis, coarse-grained localization, and the absence of large-scale datasets and unified toolchains. The authors propose an end-to-end framework that integrates AST-based structural analysis, data-flow augmentation, and pretrained code embeddings to generate vulnerability hypotheses, followed by syntax-guided tracing, chain-of-thought LLM reasoning, and causal consistency verification for fine-grained root cause localization. They introduce PHPVD, the first large-scale PHP vulnerability dataset, and develop a synergistic pipeline comprising SIFT-VulMiner, SAFE-VulMiner, and ISAL modules. Experiments demonstrate 99.7% accuracy and 99.5% F1-score on public benchmarks, along with an 81.0% localization success rate on PHPVD; real-world deployment uncovered 429 previously unknown vulnerabilities, 351 of which have been assigned CVE identifiers.

Technology Category

Application Category

📝 Abstract
PHP's dominance in web development is undermined by security challenges: static analysis lacks semantic depth, causing high false positives; dynamic analysis is computationally expensive; and automated vulnerability localization suffers from coarse granularity and imprecise context. Additionally, the absence of large-scale PHP vulnerability datasets and fragmented toolchains hinder real-world deployment. We present AutoVulnPHP, an end-to-end framework coupling two-stage vulnerability detection with fine-grained automated localization. SIFT-VulMiner (Structural Inference for Flaw Triage Vulnerability Miner) generates vulnerability hypotheses using AST structures enhanced with data flow. SAFE-VulMiner (Semantic Analysis for Flaw Evaluation Vulnerability Miner) verifies candidates through pretrained code encoder embeddings, eliminating false positives. ISAL (Incremental Sequence Analysis for Localization) pinpoints root causes via syntax-guided tracing, chain-of-thought LLM inference, and causal consistency checks to ensure precision. We contribute PHPVD, the first large-scale PHP vulnerability dataset with 26,614 files (5.2M LOC) across seven vulnerability types. On public benchmarks and PHPVD, AutoVulnPHP achieves 99.7% detection accuracy, 99.5% F1 score, and 81.0% localization rate. Deployed on real-world repositories, it discovered 429 previously unknown vulnerabilities, 351 assigned CVE identifiers, validating its practical effectiveness.
Problem

Research questions and friction points this paper is trying to address.

PHP vulnerability detection
automated vulnerability localization
false positives
large-scale dataset
toolchain fragmentation
Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-powered vulnerability detection
two-stage verification
fine-grained localization
PHP vulnerability dataset
syntax-guided tracing
🔎 Similar Papers
No similar papers found.
Z
Zhiqiang Wang
Beijing Electronic Science and Technology Institute, Beijing, China
Y
Yizhong Ding
Beijing Electronic Science and Technology Institute, Beijing, China
Z
Zilong Xiao
Beijing Electronic Science and Technology Institute, Beijing, China
J
Jinyu Lu
Beijing Electronic Science and Technology Institute, Beijing, China
Yan Jia
Yan Jia
Nankai University
IoT SecurityVulnerability DiscoverySystem SecurityNovel Attacks
Y
Yanjun Li
The 15th Research Institute of China Electronics Technology Group Corporation, Beijing, China