CausalTAD: Causal Implicit Generative Model for Debiased Online Trajectory Anomaly Detection

📅 2024-05-13
🏛️ IEEE International Conference on Data Engineering
📈 Citations: 3
Influential: 0
📄 PDF
🤖 AI Summary
Trajectory anomaly detection suffers from causal confounding induced by road network bias, where conventional conditional probability (P(T|C)) fails to disentangle spurious correlations between trajectories (T) and contextual features (C). Method: We propose the first do-intervention-based debiasing framework, replacing (P(T|C)) with the causal probability (P(T| ext{do}(C))) as the anomaly scoring criterion. Leveraging do-calculus—introduced here for the first time in this domain—we explicitly model the causal effect of semantic destinations (SD) on trajectories while decoupling and eliminating the road network structure as a confounder. Our approach integrates implicit generative modeling, counterfactual trajectory inference, and flow-based density estimation. Results: Experiments show consistent improvements: +2.1–5.7% AUC on in-distribution data and substantial gains of +10.6–32.7% on out-of-distribution (unseen routes) scenarios, demonstrating significantly enhanced generalization and robustness.

Technology Category

Application Category

📝 Abstract
Trajectory anomaly detection, aiming to estimate the anomaly risk of trajectories given the Source-Destination (SD) pairs, has become a critical problem for many real-world applications. Existing solutions directly train a generative model for observed trajectories and calculate the conditional generative probability $P(T vert C)$ as the anomaly risk, where $T$ and $C$ represent the trajectory and SD pair respectively. However, we argue that the observed trajectories are confounded by road network preference which is a common cause of both SD distribution and trajectories. Existing methods ignore this issue limiting their generalization ability on out-of-distribution trajectories. In this paper, we define the debiased trajectory anomaly detection problem and propose a causal implicit generative model, namely CausalTAD, to solve it. CausalTAD adopts do-calculus to eliminate the confounding bias of road network preference and estimates $P(Tvert do(C))$ as the anomaly criterion. Extensive experiments show that CausalTadcan not only achieve superior performance on trained trajectories but also generally improve the performance of out-of-distribution data, with improvements of 2.1% ~ 5.7% and 10.6% ~ 32.7% respectively.
Problem

Research questions and friction points this paper is trying to address.

Trajectory Anomaly Detection
Route Preference
Accuracy Improvement
Innovation

Methods, ideas, or system contributions that make the work stand out.

CausalTAD
Route Preference
Anomaly Detection
🔎 Similar Papers
No similar papers found.
W
Wenbin Li
Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, China
Di Yao
Di Yao
Institute of Computing Technology, Chinese Academy of Sciences
Spatial-Temporal Data MiningTrajectory Data MiningGraph Neural NetworkTime-series Analysis
Chang Gong
Chang Gong
AstraZeneca
Computational BiologyImmuno-oncology
X
Xiaokai Chu
Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
Q
Quanliang Jing
Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
X
Xiaolei Zhou
DiDi Global Inc.
Y
Yuxuan Zhang
DiDi Global Inc.
Y
Yunxia Fan
DiDi Global Inc.
J
Jingping Bi
Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China