Towards Multiple Missing Values-resistant Unsupervised Graph Anomaly Detection

📅 2025-11-13
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Unsupervised graph anomaly detection fails when both node attributes and structural information are simultaneously missing. Method: We propose the first unified framework robust to multi-value missingness. It employs a dual-path encoder to independently reconstruct attributes and structure, thereby preventing cross-view error propagation. In the joint latent space, we introduce latent-space regularization and joint reconstruction learning, and innovatively design a hard negative sample generation strategy based on latent-space sampling to mitigate anomaly masking caused by imputation and sharpen the decision boundary between normal and anomalous distributions. Results: Extensive experiments on seven benchmark datasets demonstrate that our method significantly outperforms existing unsupervised graph anomaly detection approaches and maintains strong robustness across varying missingness rates.

Technology Category

Application Category

📝 Abstract
Unsupervised graph anomaly detection (GAD) has received increasing attention in recent years, which aims to identify data anomalous patterns utilizing only unlabeled node information from graph-structured data. However, prevailing unsupervised GAD methods typically presuppose complete node attributes and structure information, a condition hardly satisfied in real-world scenarios owing to privacy, collection errors or dynamic node arrivals. Existing standard imputation schemes risk"repairing"rare anomalous nodes so that they appear normal, thereby introducing imputation bias into the detection process. In addition, when both node attributes and edges are missing simultaneously, estimation errors in one view can contaminate the other, causing cross-view interference that further undermines the detection performance. To overcome these challenges, we propose M$^2$V-UGAD, a multiple missing values-resistant unsupervised GAD framework on incomplete graphs. Specifically, a dual-pathway encoder is first proposed to independently reconstruct missing node attributes and graph structure, thereby preventing errors in one view from propagating to the other. The two pathways are then fused and regularized in a joint latent space so that normals occupy a compact inner manifold while anomalies reside on an outer shell. Lastly, to mitigate imputation bias, we sample latent codes just outside the normal region and decode them into realistic node features and subgraphs, providing hard negative examples that sharpen the decision boundary. Experiments on seven public benchmarks demonstrate that M$^2$V-UGAD consistently outperforms existing unsupervised GAD methods across varying missing rates.
Problem

Research questions and friction points this paper is trying to address.

Detecting anomalies in graphs with missing attributes and structure
Preventing cross-view interference between node and edge reconstruction
Mitigating imputation bias that masks anomalous patterns
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dual-pathway encoder prevents cross-view error propagation
Joint latent space separates normals and anomalies spatially
Hard negative sampling sharpens decision boundary against imputation bias
J
Jiazhen Chen
Department of Statistics and Actuarial Science, University of Waterloo
X
Xiuqin Liang
Data Science Center of Excellence, Deloitte Consulting Beijing
S
Sichao Fu
School of Electronic Information and Communications, Huazhong University of Science and Technology; Text Computing and Cognitive Intelligence Ministry of Education Engineering Research Center, Guizhou University
Z
Zheng Ma
Cheriton School of Computer Science, University of Waterloo
Weihua Ou
Weihua Ou
Guizhou Normal University
Computer Vision and Artificial Intenlligence