Data-Asymmetric Latent Imagination and Reranking for 3D Robotic Imitation Learning

📅 2026-05-11

📈 Citations: 0

✨ Influential: 0

career value

190K/year

🤖 AI Summary

This work addresses the challenge in real-world robotic imitation learning where high-quality demonstrations are scarce and abundant suboptimal or failed trajectories remain underutilized. To tackle this, the authors propose DALI-R, a novel framework that, for the first time in 3D imitation learning, integrates asymmetric data utilization, imagination replay via a 3D point cloud–based latent world model, and a task-completion scorer to re-rank action segments. This approach effectively extracts informative environmental dynamics and failure patterns from low-quality trajectories. By combining diffusion policies, flow matching, and action-chunk re-ranking, DALI-R achieves an average improvement of 6.8% in task success rate on the Adroit and MetaWorld benchmarks, with less than a 0.7× increase in inference overhead.

📝 Abstract

Robotic imitation learning typically assumes access to optimal demonstrations, yet real-world data collection often yields suboptimal, exploratory, or even failed trajectories. Discarding such data wastes valuable information about environment dynamics and failure modes, which can instead be leveraged to improve decision-making. While 3D policies reduce reliance on high-quality demonstrations through strong spatial generalization, they still require large-scale data to achieve high task success. To address this, we propose DALI-R, a Data-Asymmetric Latent Imagination and Reranking framework for 3D robotic imitation learning from mixed-quality trajectories. It learns a Latent World Model over 3D point clouds for imagined rollouts and a Task Completion Scorer that reranks candidate action chunks, improving decision-making without additional high-quality demonstrations. We instantiate DALI-R with both diffusion and efficient flow-matching policies and evaluate it on Adroit and MetaWorld benchmarks. Across the two evaluated 3D base policies, DALI-R achieves an average $6.8$\% improvement in success rate while incurring less than $0.7\times$ additional inference overhead.

Problem

Research questions and friction points this paper is trying to address.

Imitation Learning

3D Robotics

Suboptimal Demonstrations

Mixed-quality Trajectories

Data Efficiency

Innovation

Methods, ideas, or system contributions that make the work stand out.

Latent World Model

Imitation Learning

3D Point Clouds

Reranking

Flow Matching

🔎 Similar Papers

No similar papers found.

Bosch Group

Renningen, BW, DE

Research Scientist Intern, Robotic Control Policy (PhD)