ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions

📅 2025-05-20
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Current LLM agents exhibit limited proactive service capability in open-world environments, constrained by closed-loop perception and rule-based reasoning, leading to poor user intent understanding. This paper introduces the first context-aware, proactive LLM agent endowed with open-world sensory perception: it continuously acquires multimodal sensory data (e.g., video, audio) via wearable devices and jointly models cross-modal user intent with historical behavioral profiles to enable demand prediction and seamless tool invocation. We propose ContextAgentBench—the first benchmark tailored for proactive LLM agents—and a novel dual-path context-driven decision framework integrating sensory input and historical context. Experiments demonstrate that our approach achieves an 8.5% improvement in proactive prediction accuracy and a 6.0% gain in tool invocation accuracy on ContextAgentBench, covering nine daily-life scenarios and twenty distinct tools.

Technology Category

Application Category

📝 Abstract
Recent advances in Large Language Models (LLMs) have propelled intelligent agents from reactive responses to proactive support. While promising, existing proactive agents either rely exclusively on observations from enclosed environments (e.g., desktop UIs) with direct LLM inference or employ rule-based proactive notifications, leading to suboptimal user intent understanding and limited functionality for proactive service. In this paper, we introduce ContextAgent, the first context-aware proactive agent that incorporates extensive sensory contexts to enhance the proactive capabilities of LLM agents. ContextAgent first extracts multi-dimensional contexts from massive sensory perceptions on wearables (e.g., video and audio) to understand user intentions. ContextAgent then leverages the sensory contexts and the persona contexts from historical data to predict the necessity for proactive services. When proactive assistance is needed, ContextAgent further automatically calls the necessary tools to assist users unobtrusively. To evaluate this new task, we curate ContextAgentBench, the first benchmark for evaluating context-aware proactive LLM agents, covering 1,000 samples across nine daily scenarios and twenty tools. Experiments on ContextAgentBench show that ContextAgent outperforms baselines by achieving up to 8.5% and 6.0% higher accuracy in proactive predictions and tool calling, respectively. We hope our research can inspire the development of more advanced, human-centric, proactive AI assistants.
Problem

Research questions and friction points this paper is trying to address.

Enhancing proactive LLM agents with sensory context awareness
Improving user intent understanding via multi-dimensional sensory data
Automating tool calls for unobtrusive proactive assistance
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses multi-dimensional sensory data for intent understanding
Combines sensory and historical data for proactive predictions
Automatically calls tools for unobtrusive user assistance
🔎 Similar Papers
No similar papers found.
B
Bufang Yang
The Chinese University of Hong Kong
L
Lilin Xu
Columbia University
L
Liekang Zeng
The Chinese University of Hong Kong
K
Kaiwei Liu
The Chinese University of Hong Kong
Siyang Jiang
Siyang Jiang
The Chinese University of Hong Kong
Foundation ModelsFederated LearningFew-Shot LearningAIoT
W
Wenrui Lu
The Chinese University of Hong Kong
H
Hongkai Chen
The Chinese University of Hong Kong
Xiaofan Jiang
Xiaofan Jiang
Associate Professor of Electrical Engineering, Columbia University
Mobile and Embedded SystemsArtificial Intelligence of ThingsSmart Health and FitnessCPHS
Guoliang Xing
Guoliang Xing
The Chinese University of Hong Kong
Embedded AIAI for HealthAutonomous DrivingCyber-Physical SystemsWireless Networks
Z
Zhenyu Yan
The Chinese University of Hong Kong