🤖 AI Summary
This work investigates the capacity of diffusion policies to implicitly learn kinematic equality-constraint manifolds in robot imitation learning. Focusing on bimanual pick-and-place tasks, we introduce an evaluation benchmark explicitly designed to assess constraint satisfaction, combining simulation and real-robot experiments. We systematically analyze how dataset scale, quality, and constraint-manifold curvature affect implicit constraint learning. Results show that diffusion policies yield only coarse approximations of the true constraint manifold; constraint satisfaction is strongly influenced by data quality and quantity, but largely insensitive to manifold curvature; low-quality or small-scale datasets lead to substantial increases in constraint violation rates. To our knowledge, this is the first study to quantitatively characterize the limitations of diffusion policies in modeling high-dimensional kinematic constraints implicitly. Furthermore, we demonstrate that findings from simulation generalize to physical robots—providing critical empirical evidence for constraint learning in embodied intelligence based on generative models.
📝 Abstract
Diffusion policies have shown impressive results in robot imitation learning, even for tasks that require satisfaction of kinematic equality constraints. However, task performance alone is not a reliable indicator of the policy's ability to precisely learn constraints in the training data. To investigate, we analyze how well diffusion policies discover these manifolds with a case study on a bimanual pick-and-place task that encourages fulfillment of a kinematic constraint for success. We study how three factors affect trained policies: dataset size, dataset quality, and manifold curvature. Our experiments show diffusion policies learn a coarse approximation of the constraint manifold with learning affected negatively by decreases in both dataset size and quality. On the other hand, the curvature of the constraint manifold showed inconclusive correlations with both constraint satisfaction and task success. A hardware evaluation verifies the applicability of our results in the real world. Project website with additional results and visuals: https://diffusion-learns-kinematic.github.io