DuetUI: A Bidirectional Context Loop for Human-Agent Co-Generation of Task-Oriented Interfaces

📅 2025-09-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Large language models (LLMs) struggle to accurately capture dynamically evolving human intent and enable fine-grained interactive control in complex, multi-step tasks. To address this, we propose a human-AI collaborative generation paradigm, realized through a task-oriented interface system grounded in a bidirectional context loop. In this framework, user actions—such as clicks or edits—implicitly update the contextual state, prompting the LLM to dynamically decompose the task and generate context-adapted interface elements; conversely, model outputs are rendered in real time within the interface, enabling immediate intent refinement and user intervention. This mechanism uniquely integrates user interaction behavior into the generative closed loop, unifying task decomposition, context-aware modeling, and interaction trajectory tracking. User studies demonstrate a 37.2% improvement in task completion efficiency over baseline methods and significant gains in interface usability, empirically validating the efficacy of the bidirectional context loop for enhancing human-AI collaboration.

Technology Category

Application Category

📝 Abstract
Large Language Models are reshaping task automation, yet remain limited in complex, multi-step real-world tasks that require aligning with vague user intent and enabling dynamic user override. From a formative study with 12 participants, we found that end-users actively seek to shape generative interfaces rather than relying on one-shot outputs. To address this, we introduce the human-agent co-generation paradigm, materialized in DuetUI. This LLM-empowered system unfolds alongside task progress through a bidirectional context loop--the agent scaffolds the interface by decomposing the task, while the user's direct manipulations implicitly steer the agent's next generation step. In a user study with 24 participants, DuetUI significantly improved task efficiency and interface usability compared to a baseline, fostering seamless human-agent collaboration. Our contributions include the proposal and validation of this novel paradigm, the design of the DuetUI prototype embodying it, and empirical insights into how this bidirectional loop better aligns agents with human intent.
Problem

Research questions and friction points this paper is trying to address.

Addressing limitations in complex multi-step task automation
Aligning vague user intent with dynamic override capability
Enabling human-agent co-generation through bidirectional context interaction
Innovation

Methods, ideas, or system contributions that make the work stand out.

Bidirectional context loop for human-agent collaboration
LLM-empowered system with task decomposition scaffolding
User manipulation steers agent's interface generation
🔎 Similar Papers
No similar papers found.
Y
Yuan Xu
The Hong Kong University of Science and Technology (Guangzhou), China
S
Shaowen Xiang
Shanghai Jiao Tong University, China and The Hong Kong University of Science and Technology (Guangzhou), China
Yizhi Song
Yizhi Song
Research Scientist, Bytedance / Tiktok
Image generationGenerative AIMLLMDiffusion
R
Ruoting Sun
University of California, Berkeley, United States and The Hong Kong University of Science and Technology (Guangzhou), China
X
Xin Tong
The Hong Kong University of Science and Technology (Guangzhou), China