CostNav: A Navigation Benchmark for Cost-Aware Evaluation of Embodied Agents

📅 2025-11-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing navigation benchmarks overemphasize task success rate while neglecting economic feasibility—a critical bottleneck for commercial deployment of autonomous delivery robots. Method: We propose CostNav, the first micro-navigation economic evaluation framework for embodied agents, integrating hardware cost, energy consumption, collision-related maintenance expenses, and delivery revenue into a quantifiable cost–benefit model. Contribution/Results: We empirically reveal a significant decoupling between navigation success rate and economic profitability: under a 43.0% SLA compliance rate, baseline methods incur an average loss of $30.01 per delivery, with 99.7% of costs attributable to collision maintenance and no breakeven point. Leveraging industry-driven modeling and simulation-to-reality mapping, we unify rule-based control, imitation learning, and cost-aware reinforcement learning to establish economic efficiency as a primary optimization objective for navigation algorithm design.

Technology Category

Application Category

📝 Abstract
Existing navigation benchmarks focus on task success metrics while overlooking economic viability -- critical for commercial deployment of autonomous delivery robots. We introduce emph{CostNav}, a extbf{Micro-Navigation Economic Testbed} that evaluates embodied agents through comprehensive cost-revenue analysis aligned with real-world business operations. CostNav models the complete economic lifecycle including hardware, training, energy, maintenance costs, and delivery revenue with service-level agreements, using industry-derived parameters. extbf{To our knowledge, CostNav is the first work to quantitatively expose the gap between navigation research metrics and commercial viability}, revealing that optimizing for task success fundamentally differs from optimizing for economic deployment. Our cost model uses parameters derived from industry data sources (energy rates, delivery service pricing), and we project from a reduced-scale simulation to realistic deliveries. Under this projection, the baseline achieves 43.0% SLA compliance but is emph{not} commercially viable: yielding a loss of $30.009 per run with no finite break-even point, because operating costs are dominated by collision-induced maintenance, which accounts for 99.7% of per-run costs and highlights collision avoidance as a key optimization target. We demonstrate a learning-based on-device navigation baseline and establish a foundation for evaluating rule-based navigation, imitation learning, and cost-aware RL training. CostNav bridges the gap between navigation research and commercial deployment, enabling data-driven decisions about economic trade-offs across navigation paradigms.
Problem

Research questions and friction points this paper is trying to address.

Existing navigation benchmarks overlook economic viability for commercial robot deployment
Current navigation metrics differ fundamentally from optimizing for economic deployment success
Collision-induced maintenance dominates operating costs making baseline systems unprofitable
Innovation

Methods, ideas, or system contributions that make the work stand out.

Introduces CostNav economic testbed for cost-revenue analysis
Models complete economic lifecycle using industry-derived parameters
Quantitatively exposes gap between navigation metrics and commercial viability
🔎 Similar Papers
No similar papers found.
Haebin Seong
Haebin Seong
KAIST
Large Language ModelsSafetyReinforcement LearningRobotics Foundational Models
S
Sungmin Kim
MAUM.AI
M
Minchan Kim
MAUM.AI
Yongjun Cho
Yongjun Cho
MaumAI
Artificial Intelligence
Myunchul Joe
Myunchul Joe
MAUM.AI, Inc. CAIO
S
Suhwan Choi
MAUM.AI
Jaeyoon Jung
Jaeyoon Jung
MAUM AI Inc, Soongsil University, Republic of Korea
MultimodalEmbodied AI
J
Jiyong Youn
MAUM.AI
Y
Yoonshik Kim
MAUM.AI
S
Samwoo Seong
MAUM.AI
Y
Yubeen Park
MAUM.AI
Y
Youngjae Yu
Seoul National University
Y
Yunsung Lee
MAUM.AI