🤖 AI Summary
This study addresses the challenge of automatically extracting georeferenced standard shipping routes from massive Automatic Identification System (AIS) data to support vessel behavior modeling and anomaly detection (e.g., smuggling, regulatory evasion).
Method: We propose a fully unsupervised framework integrating Finite State Machine (FSM)-driven trajectory segmentation, iterative DBSCAN-based density clustering, and linear regression for adaptive parameter generalization and geospatial trajectory aggregation.
Contribution/Results: Unlike supervised or heuristic approaches, our method requires no labeled data and supports dynamic scalability. It is the first to achieve large-scale standard route modeling across the Arctic, Europe, Africa, and the Middle East. Evaluated on 1.15 TB of global AIS data spanning six years, it achieves <5% extraction anomaly rate, comprehensively covering major international shipping lanes. The framework provides a scalable, robust technical foundation for maritime regulatory enforcement and intelligent maritime analytics.
📝 Abstract
Maritime AIS (Automatic Identification Systems) data serve as a valuable resource for studying vessel behavior. This study proposes a methodology to analyze route between maritime points of interest and extract geo-referenced standard routes, as maritime patterns of life, from raw AIS data. The underlying assumption is that ships adhere to consistent patterns when travelling in certain maritime areas due to geographical, environmental, or economic factors. Deviations from these patterns may be attributed to weather conditions, seasonality, or illicit activities. This enables maritime surveillance authorities to analyze the navigational behavior between ports, providing insights on vessel route patterns, possibly categorized by vessel characteristics (type, flag, or size). Our methodological process begins by segmenting AIS data into distinct routes using a finite state machine (FSM), which describes routes as seg-ments connecting pairs of points of interest. The extracted segments are ag-gregated based on their departure and destination ports and then modelled using iterative density-based clustering to connect these ports. The cluster-ing parameters are assigned manually to sample and then extended to the en-tire dataset using linear regression. Overall, the approach proposed in this paper is unsupervised and does not require any ground truth to be trained. The approach has been tested on data on the on a six-year AIS dataset cover-ing the Arctic region and the Europe, Middle East, North Africa areas. The total size of our dataset is 1.15 Tbytes. The approach has proved effective in extracting standard routes, with less than 5% outliers, mostly due to routes with either their departure or their destination port not included in the test areas.