🤖 AI Summary
This paper addresses the weak generalization and poor adaptability of embodied intelligence in sim-to-real transfer. We propose a unified framework that synergistically integrates high-fidelity physics simulators—with their accurate modeling of external environmental dynamics—and world models—which learn compact, predictive internal representations. Our method jointly incorporates predictive coding, representation learning, and reinforcement learning to enable predictive planning and adaptive decision-making within the perception–reasoning–action loop. Key contributions include: (1) a systematic characterization of the complementary synergy between physics simulators and world models, establishing a novel paradigm bridging simulation-based training and real-world deployment; (2) an open-source, continuously updated literature repository comprehensively surveying technical approaches, state-of-the-art advances, and core challenges; and (3) a scalable theoretical and practical framework to enhance autonomy, cross-task generalization, and environmental adaptability of embodied AI systems.
📝 Abstract
The pursuit of artificial general intelligence (AGI) has placed embodied intelligence at the forefront of robotics research. Embodied intelligence focuses on agents capable of perceiving, reasoning, and acting within the physical world. Achieving robust embodied intelligence requires not only advanced perception and control, but also the ability to ground abstract cognition in real-world interactions. Two foundational technologies, physical simulators and world models, have emerged as critical enablers in this quest. Physical simulators provide controlled, high-fidelity environments for training and evaluating robotic agents, allowing safe and efficient development of complex behaviors. In contrast, world models empower robots with internal representations of their surroundings, enabling predictive planning and adaptive decision-making beyond direct sensory input. This survey systematically reviews recent advances in learning embodied AI through the integration of physical simulators and world models. We analyze their complementary roles in enhancing autonomy, adaptability, and generalization in intelligent robots, and discuss the interplay between external simulation and internal modeling in bridging the gap between simulated training and real-world deployment. By synthesizing current progress and identifying open challenges, this survey aims to provide a comprehensive perspective on the path toward more capable and generalizable embodied AI systems. We also maintain an active repository that contains up-to-date literature and open-source projects at https://github.com/NJU3DV-LoongGroup/Embodied-World-Models-Survey.