Embodied Foundation Models at the Edge: A Survey of Deployment Constraints and Mitigation Strategies

📅 2026-03-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the deployment of embodied foundation models on edge devices, which is constrained by eight interrelated factors: model size, power consumption, memory bandwidth, computational latency, timing jitter, safety margins, and others. To tackle this challenge, the paper introduces the “Deployment Gauntlet” framework, which systematically structures these constraints as eight coupled system-level barriers. It further uncovers a key distinction between autoregressive and diffusion models: the former are primarily bottlenecked by memory bandwidth, while the latter suffer from computational latency and sustained execution overhead. By co-designing model architecture decomposition, memory optimization, real-time scheduling, and communication mechanisms—and by separating fast control loops from slower semantic reasoning pathways—the proposed approach delivers an efficient and reliable deployment solution for edge-based embodied intelligence.

Technology Category

Application Category

📝 Abstract
Deploying foundation models in embodied edge systems is fundamentally a systems problem, not just a problem of model compression. Real-time control must operate within strict size, weight, and power constraints, where memory traffic, compute latency, timing variability, and safety margins interact directly. The Deployment Gauntlet organizes these constraints into eight coupled barriers that determine whether embodied foundation models can run reliably in practice. Across representative edge workloads, autoregressive Vision-Language-Action policies are constrained primarily by memory bandwidth, whereas diffusion-based controllers are limited more by compute latency and sustained execution cost. Reliable deployment therefore depends on system-level co-design across memory, scheduling, communication, and model architecture, including decompositions that separate fast control from slower semantic reasoning.
Problem

Research questions and friction points this paper is trying to address.

Embodied Foundation Models
Edge Deployment
System Constraints
Real-time Control
Memory Bandwidth
Innovation

Methods, ideas, or system contributions that make the work stand out.

Embodied Foundation Models
Edge Deployment
Deployment Gauntlet
System Co-design
Memory-Compute Trade-offs
🔎 Similar Papers
No similar papers found.
U
Utkarsh Grover
University of South Florida, USA
R
Ravi Ranjan
Florida International University, USA
M
Mingyang Mao
University of South Florida, USA
T
Trung Tien Dong
University of South Florida, USA
S
Satvik Praveen
University of South Florida, USA
Z
Zhenqi Wu
University of South Florida, USA
J
J. Morris Chang
University of South Florida, USA
Tinoosh Mohsenin
Tinoosh Mohsenin
Johns Hopkins University
Energy efficient computing for autonomous systemsmachine learning and digital signal processing
Yi Sheng
Yi Sheng
University of South Florida
Machine learning hardware&software co-design
A
Agoritsa Polyzou
Florida International University, USA
Eiman Kanjo
Eiman Kanjo
Professor, Imperial College London
TinyMLEdge AIDecentralised AICollaborative & Distribuited AIPervasive Computing
Xiaomin Lin
Xiaomin Lin
Assistant Prof, University of South Florida
AI for goodRobotics for scienceRobotics for good