🤖 AI Summary
Collaborative robots must adapt to new tasks and user preferences in sustained multi-task settings while minimizing human cognitive and physical burden.
Method: This paper proposes the Cost-Optimal Interactive Learning (COIL) framework, which formulates multi-task human–robot interaction planning as an uncapacitated facility location (UFL) problem—integrated with belief-space planning to robustly handle preference uncertainty. COIL jointly schedules three interaction types: skill queries, preference queries, and human assistance requests.
Contribution/Results: COIL is the first to cast interactive multi-task adaptation as UFL, enabling polynomial-time computation of bounded-suboptimal solutions. It guarantees high task success rates while drastically reducing human involvement. In simulation and real-world robotic arm experiments, COIL reduces total interaction count by 42% compared to baseline methods, while maintaining a task completion rate of ≥98.7%.
📝 Abstract
Collaborative robots must continually adapt to novel tasks and user preferences without overburdening the user. While prior interactive robot learning methods aim to reduce human effort, they are typically limited to single-task scenarios and are not well-suited for sustained, multi-task collaboration. We propose COIL (Cost-Optimal Interactive Learning) -- a multi-task interaction planner that minimizes human effort across a sequence of tasks by strategically selecting among three query types (skill, preference, and help). When user preferences are known, we formulate COIL as an uncapacitated facility location (UFL) problem, which enables bounded-suboptimal planning in polynomial time using off-the-shelf approximation algorithms. We extend our formulation to handle uncertainty in user preferences by incorporating one-step belief space planning, which uses these approximation algorithms as subroutines to maintain polynomial-time performance. Simulated and physical experiments on manipulation tasks show that our framework significantly reduces the amount of work allocated to the human while maintaining successful task completion.