Encoding Predictability and Legibility for Style-Conditioned Diffusion Policy

📅 2026-03-17
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of dynamically balancing action efficiency and readability in human-robot collaboration, where overly expressive motions become redundant in low-ambiguity scenarios. To this end, the authors propose Style-Conditioned Diffusion Policy (SCDP), a lightweight post-training framework built upon a frozen pre-trained diffusion policy. SCDP incorporates a scene encoder, a style-condition predictor, and an ambiguity-aware mechanism that adaptively modulates trajectory generation based on environmental ambiguity levels. This approach enables on-demand switching between efficient and legible motion styles without requiring retraining. Evaluated on manipulation and navigation tasks, SCDP significantly enhances motion readability in high-ambiguity settings while preserving near-optimal efficiency in low-ambiguity contexts, consistently outperforming fixed-style policies across diverse scenarios.

Technology Category

Application Category

📝 Abstract
Striking a balance between efficiency and transparent motion is a core challenge in human-robot collaboration, as highly expressive movements often incur unnecessary time and energy costs. In collaborative environments, legibility allows a human observer a better understanding of the robot's actions, increasing safety and trust. However, these behaviors result in sub-optimal and exaggerated trajectories that are redundant in low-ambiguity scenarios where the robot's goal is already obvious. To address this trade-off, we propose Style-Conditioned Diffusion Policy (SCDP), a modular framework that constrains the trajectory generation of a pre-trained diffusion model toward either legibility or efficiency based on the environment's configuration. Our method utilizes a post-training pipeline that freezes the base policy and trains a lightweight scene encoder and conditioning predictor to modulate the diffusion process. At inference time, an ambiguity detection module activates the appropriate conditioning, prioritizing expressive motion only for ambiguous goals and reverting to efficient paths otherwise. We evaluate SCDP on manipulation and navigation tasks, and results show that it enhances legibility in ambiguous settings while preserving optimal efficiency when legibility is unnecessary, all without retraining the base policy.
Problem

Research questions and friction points this paper is trying to address.

legibility
efficiency
human-robot collaboration
motion planning
ambiguity
Innovation

Methods, ideas, or system contributions that make the work stand out.

Style-Conditioned Diffusion Policy
Legibility-Efficiency Trade-off
Post-Training Conditioning
Ambiguity-Aware Motion Generation
Diffusion Policy Modulation
🔎 Similar Papers
No similar papers found.
A
Adrien Jacquet Crétides
Institut des Systèmes Intelligents et de Robotique (ISIR), CNRS UMR7222, Inserm ERL U1150, Sorbonne Université, Paris, France
Mouad Abrini
Mouad Abrini
PhD Student, Sorbonne University
RoboticsHuman-Robot InteractionArtificial IntelligenceSocial Robotics
H
Hamed Rahimi
Institut des Systèmes Intelligents et de Robotique (ISIR), CNRS UMR7222, Inserm ERL U1150, Sorbonne Université, Paris, France
Mohamed Chetouani
Mohamed Chetouani
Professor, Sorbonne Universite, ISIR-UPMC, CNRS
social signal processinghuman-robot interactioninteractive machine learningmultimodal interaction