TxRay: Agentic Postmortem of Live Blockchain Attacks

πŸ“… 2026-02-01
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the inefficiency and incompleteness of manual postmortem analyses in DeFi attacks, which often suffer from limited on-chain evidence and risk missing critical details. To overcome this, we propose the first automated system that integrates large language model (LLM) agents with executable semantic assertions to reconstruct the full lifecycle of Anyone-Can-Take–style attacks from minimal transaction seeds. The system generates self-verifiable, hardcoding-free proof-of-concept (PoC) exploits and introduces PoCEvaluator, an independent evaluation framework to ensure output quality. Evaluated on 114 real-world incidents, our approach achieves a 92.11% end-to-end reproduction rate, with 98.1% of generated PoCs generalizing to new contract addresses. It localizes root causes in a median of 40 minutes and produces functional PoCs within 59 minutes.

Technology Category

Application Category

πŸ“ Abstract
Decentralized Finance (DeFi) has turned blockchains into financial infrastructure, allowing anyone to trade, lend, and build protocols without intermediaries, but this openness exposes pools of value controlled by code. Within five years, the DeFi ecosystem has lost over 15.75B USD to reported exploits. Many exploits arise from permissionless opportunities that any participant can trigger using only public state and standard interfaces, which we call Anyone-Can-Take (ACT) opportunities. Despite on-chain transparency, postmortem analysis remains slow and manual: investigations start from limited evidence, sometimes only a single transaction hash, and must reconstruct the exploit lifecycle by recovering related transactions, contract code, and state dependencies. We present TxRay, a Large Language Model (LLM) agentic postmortem system that uses tool calls to reconstruct live ACT attacks from limited evidence. Starting from one or more seed transactions, TxRay recovers the exploit lifecycle, derives an evidence-backed root cause, and generates a runnable, self-contained Proof of Concept (PoC) that deterministically reproduces the incident. TxRay self-checks postmortems by encoding incident-specific semantic oracles as executable assertions. To evaluate PoC correctness and quality, we develop PoCEvaluator, an independent agentic execution-and-review evaluator. On 114 incidents from DeFiHackLabs, TxRay produces an expert-aligned root cause and an executable PoC for 105 incidents, achieving 92.11% end-to-end reproduction. Under PoCEvaluator, 98.1% of TxRay PoCs avoid hard-coding attacker addresses, a +22.9pp lift over DeFiHackLabs. In a live deployment, TxRay delivers validated root causes in 40 minutes and PoCs in 59 minutes at median latency. TxRay's oracle-validated PoCs enable attack imitation, improving coverage by 15.6% and 65.5% over STING and APE.
Problem

Research questions and friction points this paper is trying to address.

DeFi
postmortem analysis
blockchain attacks
ACT opportunities
exploit reconstruction
Innovation

Methods, ideas, or system contributions that make the work stand out.

Agentic Postmortem
Anyone-Can-Take (ACT) Exploits
LLM Tool Use
Executable Proof of Concept
Semantic Oracles
πŸ”Ž Similar Papers
No similar papers found.