Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI

📅 2025-10-06
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Current AI systems exhibit a fundamental disconnect between physics-aware perception and symbolic physical reasoning, hindering the deep integration of physical laws into artificial intelligence. To address this, we propose the first unified framework that bridges perception, reasoning, modeling, and interaction—thereby realizing a next-generation world model grounded in both physical priors and embodied causal inference. Our approach formally unifies *theoretical physics reasoning* (axiomatic, formal deduction) with *practical physics understanding* (causal modeling grounded in embodied interaction), systematically integrating symbolic reasoning, generative modeling, multimodal perception, and first-principles physics modeling. The resulting structured physics-AI knowledge system is released as an open-source resource library. Empirical evaluation demonstrates substantial improvements in model interpretability, cross-scenario generalizability, and physical consistency—establishing foundational progress toward safe, generalizable, and verifiable physics-integrated intelligence.

Technology Category

Application Category

📝 Abstract
The rapid advancement of embodied intelligence and world models has intensified efforts to integrate physical laws into AI systems, yet physical perception and symbolic physics reasoning have developed along separate trajectories without a unified bridging framework. This work provides a comprehensive overview of physical AI, establishing clear distinctions between theoretical physics reasoning and applied physical understanding while systematically examining how physics-grounded methods enhance AI's real-world comprehension across structured symbolic reasoning, embodied systems, and generative models. Through rigorous analysis of recent advances, we advocate for intelligent systems that ground learning in both physical principles and embodied reasoning processes, transcending pattern recognition toward genuine understanding of physical laws. Our synthesis envisions next-generation world models capable of explaining physical phenomena and predicting future states, advancing safe, generalizable, and interpretable AI systems. We maintain a continuously updated resource at https://github.com/AI4Phys/Awesome-AI-for-Physics.
Problem

Research questions and friction points this paper is trying to address.

Bridging physical perception and symbolic reasoning in AI
Integrating physical laws into embodied intelligence systems
Developing unified frameworks for physics-grounded world models
Innovation

Methods, ideas, or system contributions that make the work stand out.

Integrating physical laws into AI systems
Bridging physical perception and symbolic reasoning
Developing physics-grounded world models for prediction
🔎 Similar Papers
No similar papers found.
Kun Xiang
Kun Xiang
School of Intelligent Systems Engineering, Sun Yat-Sen University
Symbolic ReasoningMedical Imaging Process
Terry Jingchen Zhang
Terry Jingchen Zhang
ETH Zurich
(Multimodal) ReasoningAI SafetyActionable InterpretabilityAI4ScienceAstrophysics
Yinya Huang
Yinya Huang
Postdoc Fellow at ETH AI Center, ETH Zürich; Prev. CityU Hong Kong, SYSU
AI for MathAI for ScienceReliable Machine LearningLLMsNLP
J
Jixi He
Shenzhen Campus of Sun Yat-sen University, Shenzhen, China
Z
Zirong Liu
Shenzhen Campus of Sun Yat-sen University, Shenzhen, China
Y
Yueling Tang
Shenzhen Campus of Sun Yat-sen University, Shenzhen, China
Ruizhe Zhou
Ruizhe Zhou
Northwestern Polytechnical University
UAV communicationsNavigation and positioning
L
Lijing Luo
Shenzhen Campus of Sun Yat-sen University, Shenzhen, China
Y
Youpeng Wen
Shenzhen Campus of Sun Yat-sen University, Shenzhen, China
X
Xiuwei Chen
Shenzhen Campus of Sun Yat-sen University, Shenzhen, China
Bingqian Lin
Bingqian Lin
Shanghai Jiao Tong University | Sun Yat-sen University
Computer VisionMultimodalEmbodied AI
Jianhua Han
Jianhua Han
2030 Research, YinWang, Huawei
Vision Language ModelFoundation ModelVLA
H
Hang Xu
Yinwang Intelligent Technology Co., Ltd. Shenzhen, China
Hanhui Li
Hanhui Li
Sun Yat-sen University
Deep LearningComputer Vision
B
Bin Dong
Peking University and Beijing International Center for Mathematical Research, Beijing, China
Xiaodan Liang
Xiaodan Liang
Professor of Computer Science, Sun Yat-sen University, MBZUAI, CMU, NUS
Computer visionEmbodied AIMachine learning