Learning API Functionality from Demonstrations for Tool-based Agents

📅 2025-05-30

📈 Citations: 0

✨ Influential: 0

career value

198K/year

🤖 AI Summary

API documentation is often missing, outdated, or unreliable, leading to frequent failures in tool-using agents’ API invocations. Method: This paper proposes a documentation-free paradigm for learning API functionality, decoupling API understanding from documentation for the first time. It integrates expert demonstrations with autonomous exploration and introduces explicit function calling alongside natural-language critique feedback to jointly optimize parameter instantiation and invocation logic. Contribution/Results: Extensive experiments across five large language models and three API benchmarks demonstrate that, under documentation-free settings, our method improves task success rate by 27.4% on average. Moreover, it identifies critical failure modes and clarifies core challenges in self-improving API agents. This work establishes a scalable, robust methodology for building general-purpose, tool-augmented intelligent agents.

Technology Category

Application Category

📝 Abstract

Digital tool-based agents that invoke external Application Programming Interfaces (APIs) often rely on documentation to understand API functionality. However, such documentation is frequently missing, outdated, privatized, or inconsistent-hindering the development of reliable, general-purpose agents. In this work, we propose learning API functionality directly from demonstrations as a new paradigm applicable in scenarios without documentation. Using existing API benchmarks, we collect demonstrations from both expert API-based agents and from self-exploration. To understand what information demonstrations must convey for successful task completion, we extensively study how the number of demonstrations and the use of LLM-generated summaries and evaluations affect the task success rate of the API-based agent. Our experiments across 3 datasets and 5 models show that learning functionality from demonstrations remains a non-trivial challenge, even for state-of-the-art LLMs. We find that providing explicit function calls and natural language critiques significantly improves the agent's task success rate due to more accurate parameter filling. We analyze failure modes, identify sources of error, and highlight key open challenges for future work in documentation-free, self-improving, API-based agents.

Problem

Research questions and friction points this paper is trying to address.

Learning API functionality without relying on documentation

Improving task success rates through demonstrations and critiques

Addressing challenges in documentation-free API-based agents

Innovation

Methods, ideas, or system contributions that make the work stand out.

Learning API functionality from demonstrations

Using expert and self-exploration demonstrations

Improving success with function calls and critiques

🔎 Similar Papers

ToolACE: Winning the Points of LLM Function Calling