MemRepair: Hierarchical Memory for Agentic Repository-Level Vulnerability Repair

📅 2026-05-17

📈 Citations: 0

✨ Influential: 0

career value

241K/year

🤖 AI Summary

This work addresses the limitation of existing large language model–based vulnerability repair approaches, which lack persistent memory mechanisms and thus struggle to reuse experience or learn from failures in complex, multi-file scenarios. To overcome this, the authors propose a hierarchical memory agent framework that models repair as an iterative, experience-driven optimization loop. The framework innovatively integrates three memory layers—History-Fix, Security-Pattern, and Refinement-Trajectory—to respectively enable cross-file repair convention reuse, extraction of security-critical patterns, and tracking of refinement trajectories from failure to success. Coupled with a memory-augmented architecture, retrieval mechanisms, and feedback-driven patch refinement, the method achieves repair success rates of 58.0%, 58.2%, and 30.58% on SEC-Bench, PatchEval, and Multi-SWE-bench (C++), significantly outperforming baselines such as OpenHands, SWE-agent, and InfCode-C++.

📝 Abstract

Modern software ecosystems face a rapidly growing number of disclosed vulnerabilities, increasing the need for automated repair techniques that can operate reliably at repository scale. Although Large Language Model (LLM)-based agents have recently shown promise for automated vulnerability repair (AVR), most existing systems still treat repair as a single generation step over the currently visible code context. As a result, they lack a persistent mechanism for reusing prior fixes or learning from failed validation attempts, which limits their effectiveness on complex, multi-file repair tasks. We present MemRepair, a memory-augmented agentic framework that formulates vulnerability repair as an iterative, experience-driven process. MemRepair combines three complementary memory layers, i.e., History-Fix, Security-Pattern, and Refinement-Trajectory memories, with a dynamic feedback-driven refinement loop. This design allows the agent to retrieve repository-specific repair conventions, apply reusable security defenses, and exploit prior "failure-to-success" trajectories to revise semantically invalid patches based on runtime evidence. We evaluate MemRepair on three representative repository-level vulnerability repair benchmarks: SEC-Bench, PatchEval (Python, Go, JavaScript), and the C++ subset of Multi-SWE-bench. MemRepair achieves state-of-the-art resolution rates of 58.0%, 58.2%, and 30.58%, respectively, outperforming strong general-purpose agents such as OpenHands and SWE-agent, as well as the specialized AVR tool InfCode-C++, while maintaining competitive repair cost. These results show that persistent, hierarchical repair memory can substantially improve the reliability of agentic vulnerability repair across diverse languages and repository settings.

Problem

Research questions and friction points this paper is trying to address.

automated vulnerability repair

repository-level repair

multi-file repair

LLM-based agents

repair memory

Innovation

Methods, ideas, or system contributions that make the work stand out.

hierarchical memory

agentic vulnerability repair

iterative refinement