Agent JIT Compilation for Latency-Optimizing Web Agent Planning and Scheduling

📅 2026-05-20
📈 Citations: 0
Influential: 0
📄 PDF

career value

219K/year
🤖 AI Summary
This work addresses the high latency and tool misuse inherent in current agents that rely on repeated large language model (LLM) invocations for natural language tasks. It introduces, for the first time, just-in-time (JIT) compilation principles into web agent planning, proposing JIT-Planner and JIT-Scheduler. JIT-Planner directly compiles task descriptions into verified, low-cost executable code plans, while JIT-Scheduler optimizes parallel execution through Monte Carlo-based scheduling. To enhance correctness, the approach incorporates a formal tool protocol constrained by state invariants. Experimental results across five web applications demonstrate that JIT-Planner achieves a 10.4× speedup over Browser-Use with a 28% improvement in accuracy, and JIT-Scheduler yields a 2.4× speedup over OpenAI CUA with a 9% gain in accuracy.
📝 Abstract
Computer-use agents (CUA) automate tasks specified with natural language such as "order the cheapest item from Taco Bell" by generating sequences of calls to tools such as click, type, and scroll on a browser. Current implementations follow a sequential fetch-screenshot-execute loop where each iteration requires an LLM call, resulting in high latency and frequent errors from incorrect tool use. We present agent just-in-time (JIT) compilation, an alternative that compiles task descriptions directly into executable code that is free to include LLM calls, tool calls, and parallelization. Our approach comprises three components: (1) JIT-Planner, which generates multiple code plans, validates each against tool specifications, and selects the minimum-cost candidate; (2) JIT-Scheduler, which explores parallelization strategies via Monte Carlo cost estimation from learned latency distributions; and (3) an invariant-enforcing tool protocol specifying precondition and postcondition state requirements that reduce the rate of generating plans with incorrect tool use. Across 5 web applications, JIT-Planner achieves $10.4\times$ speedup and $+28\%$ accuracy over Browser-Use, while JIT-Scheduler achieves $2.4\times$ speedup and $+9\%$ accuracy over OpenAI CUA.
Problem

Research questions and friction points this paper is trying to address.

computer-use agents
latency
tool use errors
web automation
sequential execution
Innovation

Methods, ideas, or system contributions that make the work stand out.

just-in-time compilation
agent planning
parallelization
tool protocol
latency optimization
🔎 Similar Papers
2024-07-01arXiv.orgCitations: 85