DRoPE: Directional Rotary Position Embedding for Efficient Agent Interaction Modeling

📅 2025-03-19
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Autonomous driving trajectory generation faces the “impossible triangle” trade-off among accuracy, computational efficiency, and memory overhead. To address this, we propose Directional Rotational Position Encoding (DRoPE), the first method to align RoPE’s 2D rotational encoding with agent heading—enabling zero-overhead modeling of relative angular information via an identity scalar correction. We theoretically prove DRoPE achieves O(n) time complexity while introducing no additional memory footprint. Extensive evaluation across multiple state-of-the-art models demonstrates: (i) maintained or improved trajectory prediction accuracy; (ii) up to 47% reduction in GPU memory consumption; and (iii) inference latency scaling linearly with sequence length. Our approach bridges theoretical rigor and engineering practicality, establishing a new paradigm for efficient end-to-end trajectory prediction.

Technology Category

Application Category

📝 Abstract
Accurate and efficient modeling of agent interactions is essential for trajectory generation, the core of autonomous driving systems. Existing methods, scene-centric, agent-centric, and query-centric frameworks, each present distinct advantages and drawbacks, creating an impossible triangle among accuracy, computational time, and memory efficiency. To break this limitation, we propose Directional Rotary Position Embedding (DRoPE), a novel adaptation of Rotary Position Embedding (RoPE), originally developed in natural language processing. Unlike traditional relative position embedding (RPE), which introduces significant space complexity, RoPE efficiently encodes relative positions without explicitly increasing complexity but faces inherent limitations in handling angular information due to periodicity. DRoPE overcomes this limitation by introducing a uniform identity scalar into RoPE's 2D rotary transformation, aligning rotation angles with realistic agent headings to naturally encode relative angular information. We theoretically analyze DRoPE's correctness and efficiency, demonstrating its capability to simultaneously optimize trajectory generation accuracy, time complexity, and space complexity. Empirical evaluations compared with various state-of-the-art trajectory generation models, confirm DRoPE's good performance and significantly reduced space complexity, indicating both theoretical soundness and practical effectiveness. The video documentation is available at https://drope-traj.github.io/.
Problem

Research questions and friction points this paper is trying to address.

Efficient modeling of agent interactions for autonomous driving.
Overcoming limitations in accuracy, computational time, and memory efficiency.
Improving trajectory generation with directional rotary position embedding.
Innovation

Methods, ideas, or system contributions that make the work stand out.

DRoPE enhances RoPE for agent interaction modeling.
Introduces uniform identity scalar for angular alignment.
Optimizes accuracy, time, and space complexity simultaneously.
🔎 Similar Papers
No similar papers found.
J
Jianbo Zhao
University of Science and Technology of China, 96 Jinzhai Rd, Hefei 230026, China; Mach Drive, Beijing, China
T
Taiyu Ban
University of Science and Technology of China, 96 Jinzhai Rd, Hefei 230026, China
Z
Zhihao Liu
Mach Drive, Beijing, China
H
Hangning Zhou
Mach Drive, Beijing, China
X
Xiyang Wang
Mach Drive, Beijing, China
Q
Qibin Zhou
Mach Drive, Beijing, China
Hailong Qin
Hailong Qin
NIO, Changan Automobile, Deeproute.ai,Roadstar.ai, National University of Singapore
3D ReconstructionRobot NavigationPerception and LocalizationMAV
M
Mu Yang
Mach Drive, Beijing, China
L
Lei Liu
University of Science and Technology of China, 96 Jinzhai Rd, Hefei 230026, China
B
Bin Li
University of Science and Technology of China, 96 Jinzhai Rd, Hefei 230026, China