An Real-Sim-Real (RSR) Loop Framework for Generalizable Robotic Policy Transfer with Differentiable Simulation

📅 2025-03-13

📈 Citations: 0

✨ Influential: 0

career value

216K/year

🤖 AI Summary

Model mismatch in sim-to-real transfer severely hinders deployment of robotic policies trained in simulation. Method: This paper proposes the Real-Sim-Real (RSR) closed-loop framework, which jointly optimizes policy and simulator parameters via gradient-based online system identification using differentiable simulation (MuJoCo MJX), dynamically aligning simulated dynamics with real-world behavior. An information-theoretic cost function guides active, diverse real-world data collection to maximize parameter identifiability. Unlike conventional unidirectional sim-to-real pipelines, RSR establishes an iterative “real → sim → real” optimization loop, tightly coupling reinforcement learning (PPO/SAC) with online system identification. Contribution/Results: RSR significantly reduces the sim-to-real performance gap across diverse manipulation tasks. It demonstrates strong robustness to both explicit (e.g., mass, friction) and implicit (e.g., unmodeled contact dynamics) environmental uncertainties, and exhibits superior cross-scenario generalization without task-specific retraining.

Technology Category

Application Category

📝 Abstract

The sim-to-real gap remains a critical challenge in robotics, hindering the deployment of algorithms trained in simulation to real-world systems. This paper introduces a novel Real-Sim-Real (RSR) loop framework leveraging differentiable simulation to address this gap by iteratively refining simulation parameters, aligning them with real-world conditions, and enabling robust and efficient policy transfer. A key contribution of our work is the design of an informative cost function that encourages the collection of diverse and representative real-world data, minimizing bias and maximizing the utility of each data point for simulation refinement. This cost function integrates seamlessly into existing reinforcement learning algorithms (e.g., PPO, SAC) and ensures a balanced exploration of critical regions in the real domain. Furthermore, our approach is implemented on the versatile Mujoco MJX platform, and our framework is compatible with a wide range of robotic systems. Experimental results on several robotic manipulation tasks demonstrate that our method significantly reduces the sim-to-real gap, achieving high task performance and generalizability across diverse scenarios of both explicit and implicit environmental uncertainties.

Problem

Research questions and friction points this paper is trying to address.

Addresses the sim-to-real gap in robotics using a Real-Sim-Real loop framework.

Introduces a cost function for diverse real-world data collection to refine simulations.

Enhances policy transfer efficiency and generalizability across robotic manipulation tasks.

Innovation

Methods, ideas, or system contributions that make the work stand out.

Real-Sim-Real loop with differentiable simulation

Informative cost function for diverse data collection

Integration with Mujoco MJX and reinforcement learning

🔎 Similar Papers

Autonomous Vehicle Controllers From End-to-End Differentiable Simulation

2024-09-12arXiv.orgCitations: 0

Bosch Group

Renningen, BW, DE

Robotics Autonomy Engineer-Planning and Control

Field AI

Irvine, CA

Research Scientist Intern, Robotic Control Policy (PhD)