Mixed-Reality Digital Twins: Leveraging the Physical and Virtual Worlds for Hybrid Sim2Real Transition of Multi-Agent Reinforcement Learning Policies

📅 2024-03-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the challenges of prohibitively long training times, high real-world deployment costs, and poor sim-to-real transfer performance in multi-agent reinforcement learning (MARL) for cyber-physical vehicular systems, this paper proposes a hybrid-reality digital twin framework. The framework introduces an on-demand dynamic parallelization-based load scheduling mechanism to enable elastic scaling of simulation resources, coupled with a systematic domain randomization strategy to enhance policy generalization. Our approach enables efficient collaborative MARL training and zero-shot sim-to-real transfer. Extensive evaluation across cooperative and adversarial traffic scenarios demonstrates its effectiveness: training time is reduced by up to 76.3%, while the sim-to-real performance gap is narrowed to just 2.9%. The method thus achieves a compelling balance between computational scalability and physical deployability.

Technology Category

Application Category

📝 Abstract
Multi-agent reinforcement learning (MARL) for cyber-physical vehicle systems usually requires a significantly long training time due to their inherent complexity. Furthermore, deploying the trained policies in the real world demands a feature-rich environment along with multiple physical embodied agents, which may not be feasible due to monetary, physical, energy, or safety constraints. This work seeks to address these pain points by presenting a mixed-reality digital twin framework capable of: (i) selectively scaling parallelized workloads on-demand, and (ii) evaluating the trained policies across simulation-to-reality (sim2real) experiments. The viability and performance of the proposed framework are highlighted through two representative use cases, which cover cooperative as well as competitive classes of MARL problems. We study the effect of: (i) agent and environment parallelization on training time, and (ii) systematic domain randomization on zero-shot sim2real transfer across both case studies. Results indicate up to 76.3% reduction in training time with the proposed parallelization scheme and sim2real gap as low as 2.9% using the proposed deployment method.
Problem

Research questions and friction points this paper is trying to address.

Reduces training time for multi-agent reinforcement learning in cyber-physical systems.
Enables scalable and cost-effective deployment of trained policies in real-world environments.
Improves simulation-to-reality transfer accuracy using domain randomization techniques.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Mixed-reality digital twin framework for MARL
Selective scaling of parallelized workloads
Systematic domain randomization for sim2real transfer
🔎 Similar Papers
No similar papers found.
C
C. Samak
Department of Automotive Engineering, Clemson University International Center for Automotive Research (CU-ICAR), Greenville, SC 29607, USA
Tanmay Vilas Samak
Tanmay Vilas Samak
PhD Candidate at CU-ICAR
RoboticsAutonomous SystemsDigital TwinsReal2SimSim2Real
V
V. Krovi
Department of Automotive Engineering, Clemson University International Center for Automotive Research (CU-ICAR), Greenville, SC 29607, USA