Safety in Embodied AI: A Survey of Risks, Attacks, and Defenses

📅 2026-03-28
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Embodied intelligence faces significant challenges in open, safety-critical environments due to perceptual uncertainty, incomplete knowledge, and dynamic human–agent interactions, which can lead to physical risks. This work systematically reviews security threats and defense mechanisms across the full-stack pipeline—spanning perception, cognition, planning, action, and interaction—and proposes the first unified, multi-layered safety taxonomy that integrates research on adversarial attacks, backdoors, jailbreaking, and hardware-level exploits. Drawing on an analysis of over 500 studies, the paper identifies critical vulnerabilities such as fragility in multimodal perception fusion, instability in planning, and a lack of mutual trust in human–agent collaboration. It further highlights key research gaps and provides a systematic roadmap toward developing safe, reliable, and deployable embodied agents.
📝 Abstract
Embodied Artificial Intelligence (Embodied AI) integrates perception, cognition, planning, and interaction into agents that operate in open-world, safety-critical environments. As these systems gain autonomy and enter domains such as transportation, healthcare, and industrial or assistive robotics, ensuring their safety becomes both technically challenging and socially indispensable. Unlike digital AI systems, embodied agents must act under uncertain sensing, incomplete knowledge, and dynamic human-robot interactions, where failures can directly lead to physical harm. This survey provides a comprehensive and structured review of safety research in embodied AI, examining attacks and defenses across the full embodied pipeline, from perception and cognition to planning, action and interaction, and agentic system. We introduce a multi-level taxonomy that unifies fragmented lines of work and connects embodied-specific safety findings with broader advances in vision, language, and multimodal foundation models. Our review synthesizes insights from over 400 papers spanning adversarial, backdoor, jailbreak, and hardware-level attacks; attack detection, safe training and robust inference; and risk-aware human-agent interaction. This analysis reveals several overlooked challenges, including the fragility of multimodal perception fusion, the instability of planning under jailbreak attacks, and the trustworthiness of human-agent interaction in open-ended scenarios. By organizing the field into a coherent framework and identifying critical research gaps, this survey provides a roadmap for building embodied agents that are not only capable and autonomous but also safe, robust, and reliable in real-world deployment.
Problem

Research questions and friction points this paper is trying to address.

Embodied AI
Safety
Risk
Attack
Human-Agent Interaction
Innovation

Methods, ideas, or system contributions that make the work stand out.

Embodied AI
Safety Taxonomy
Multimodal Perception
Adversarial Robustness
Human-Agent Interaction
X
Xiao Li
Fudan University
Xiang Zheng
Xiang Zheng
Department of Computer Science, City University of Hong Kong
Reinforcement LearningTrustworthy AIEmbodied AI
Y
Yifeng Gao
Fudan University
X
Xinyu Xia
Jilin University
Y
Yixu Wang
Fudan University
Xin Wang
Xin Wang
Fudan University
Computer VisionTrustworthy ML
Y
Ye Sun
Fudan University
Yunhan Zhao
Yunhan Zhao
Fudan University
AI Safety
M
Ming Wen
Fudan University, Shanghai Innovation Institute
J
Jiayu Li
Fudan University
Z
Zixing Chen
Fudan University
Xun Gong
Xun Gong
Associate Professor, Jilin University
Connected Vehicle ControlDriver Modeling
Yi Liu
Yi Liu
Department of Computer Science, City University of Hong Kong
Security and PrivacyFederated LearningAI Security
Yige Li
Yige Li
Singapore Management University
Trustworthy Machine Learning
Y
Yutao Wu
Deakin University
Cong Wang
Cong Wang
Department of Computer Science, City University of Hong Kong
cloudsecuritybig datacomputation outsourcingaccess control
J
Jun Sun
Singapore Management University
Yixin Cao
Yixin Cao
Fudan University
Natural Language ProcessingKnowledge EngineeringMulti-modal data processing
Zhineng Chen
Zhineng Chen
Institute of Trustworthy Embodied AI, Fudan University
Computer VisionOCRMultimedia Analysis
Jingjing Chen
Jingjing Chen
Fudan University
MultimediaComputer VisionMachine LearningPattern recognition
T
Tao Gui
Fudan University, Shanghai Innovation Institute
Qi Zhang
Qi Zhang
Fudan University
SAGINsatellite routing
Zuxuan Wu
Zuxuan Wu
Fudan University
X
Xipeng Qiu
Fudan University, Shanghai Innovation Institute
X
Xuanjing Huang
Fudan University