Falsification-Driven Reinforcement Learning for Maritime Motion Planning

📅 2025-10-08

📈 Citations: 0

✨ Influential: 0

career value

203K/year

🤖 AI Summary

Autonomous vessels struggle to simultaneously satisfy maritime traffic regulations and maintain robust navigation policies in complex, dynamic sea environments. Method: This paper proposes a counterexample-driven reinforcement learning framework that formalizes maritime rules using Signal Temporal Logic (STL), automatically synthesizes high-risk violation scenarios as adversarial training samples, and enables closed-loop policy optimization and compliance verification under formal constraints. Contribution/Results: By directly embedding STL specifications into the training process, the framework enhances agent understanding and generalization to dynamic navigational constraints via counterexample-guided learning. In dual-vessel open-sea navigation experiments, the method achieves a 23.6% improvement in rule compliance rate, generates more challenging and contextually relevant training scenarios, and yields policies with superior safety and robustness compared to baseline approaches.

Technology Category

Application Category

📝 Abstract

Compliance with maritime traffic rules is essential for the safe operation of autonomous vessels, yet training reinforcement learning (RL) agents to adhere to them is challenging. The behavior of RL agents is shaped by the training scenarios they encounter, but creating scenarios that capture the complexity of maritime navigation is non-trivial, and real-world data alone is insufficient. To address this, we propose a falsification-driven RL approach that generates adversarial training scenarios in which the vessel under test violates maritime traffic rules, which are expressed as signal temporal logic specifications. Our experiments on open-sea navigation with two vessels demonstrate that the proposed approach provides more relevant training scenarios and achieves more consistent rule compliance.

Problem

Research questions and friction points this paper is trying to address.

Ensuring autonomous vessels comply with maritime traffic rules

Generating adversarial training scenarios for reinforcement learning agents

Improving rule compliance in complex maritime navigation environments

Innovation

Methods, ideas, or system contributions that make the work stand out.

Falsification-driven RL generates adversarial training scenarios

Scenarios violate maritime traffic rules as specifications

Approach improves rule compliance in navigation experiments

🔎 Similar Papers

No similar papers found.