Program Analysis Guided LLM Agent for Proof-of-Concept Generation

📅 2026-04-08

📈 Citations: 0

✨ Influential: 0

career value

179K/year

🤖 AI Summary

Existing approaches to automatically generating reliable proof-of-concept (PoC) inputs that trigger vulnerabilities suffer from either low success rates or poor scalability. This work proposes a novel LLM-based agent framework that integrates lightweight static analysis with dynamic execution feedback—including sanitizer instrumentation and coverage tracking—to guide the generation of more effective PoCs. For the first time, it deeply incorporates rule-driven static analysis and dynamic runtime information into large language models. Experimental results demonstrate that the proposed method achieves a 132% improvement in success rate over the current state-of-the-art agent-based approach on PoC generation tasks, significantly enhancing both effectiveness and scalability.

Technology Category

Application Category

📝 Abstract

Software developers frequently receive vulnerability reports that require them to reproduce the vulnerability in a reliable manner by generating a proof-of-concept (PoC) input that triggers it. Given the source code for a software project and a specific code location for a potential vulnerability, automatically generating a PoC for the given vulnerability has been a challenging research problem. Symbolic execution and fuzzing techniques require expert guidance and manual steps and face scalability challenges for PoC generation. Although recent advances in LLMs have increased the level of automation and scalability, the success rate of PoC generation with LLMs remains quite low. In this paper, we present a novel approach called Program Analysis Guided proof of concept generation agENT (PAGENT) that is scalable and significantly improves the success rate of automated PoC generation compared to prior results. PAGENT integrates lightweight and rule-based static analysis phases for providing static analysis guidance and sanitizer-based profiling and coverage information for providing dynamic analysis guidance with a PoC generation agent. Our experiments demonstrate that the resulting hybrid approach significantly outperforms the prior top-performing agentic approach by 132% for the PoC generation task.

Problem

Research questions and friction points this paper is trying to address.

Proof-of-Concept generation

vulnerability reproduction

program analysis

automated PoC

software security

Innovation

Methods, ideas, or system contributions that make the work stand out.

Program Analysis

Proof-of-Concept Generation

LLM Agent