🤖 AI Summary
Addressing socially aware navigation in human-robot cohabited public spaces, this paper proposes the first synergistic distributional coupling framework integrating port-Hamiltonian dynamical modeling with conditional diffusion models. Methodologically, it introduces port-Hamiltonian systems—ensuring physically interpretable and energy-consistent interaction dynamics—to social navigation for the first time; couples diffusion models to explicitly capture human behavioral uncertainty; and jointly optimizes social compliance and interpretability via spatiotemporal Transformers and reinforcement learning from human feedback (RLHF). Evaluated across multiple real-world benchmark scenarios, our approach significantly outperforms state-of-the-art methods: improving navigation stability by 23.6%, reducing collision rate by 41.2%, and increasing human comfort scores by 35.8%. The core contribution lies in establishing a unified navigation paradigm that simultaneously guarantees physical consistency, enables socially grounded intention inference, and adapts to human preferences.
📝 Abstract
Navigating in human-filled public spaces is a critical challenge for deploying autonomous robots in real-world environments. This paper introduces NaviDIFF, a novel Hamiltonian-constrained socially-aware navigation framework designed to address the complexities of human-robot interaction and socially-aware path planning. NaviDIFF integrates a port-Hamiltonian framework to model dynamic physical interactions and a diffusion model to manage uncertainty in human-robot cooperation. The framework leverages a spatial-temporal transformer to capture social and temporal dependencies, enabling more accurate spatial-temporal environmental dynamics understanding and port-Hamiltonian physical interactive process construction. Additionally, reinforcement learning from human feedback is employed to fine-tune robot policies, ensuring adaptation to human preferences and social norms. Extensive experiments demonstrate that NaviDIFF outperforms state-of-the-art methods in social navigation tasks, offering improved stability, efficiency, and adaptability.