Learning to Explore: Policy-Guided Outlier Synthesis for Graph Out-of-Distribution Detection

📅 2026-02-28
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the limitations of unsupervised graph-level out-of-distribution (OOD) detection, where reliance solely on in-distribution data leads to insufficient characterization of the feature space and ambiguous decision boundaries. To overcome these challenges, the authors propose PGOS, a policy-guided outlier synthesis framework that introduces, for the first time, a learnable reinforcement learning exploration policy into graph OOD detection. By deploying an agent that actively explores low-density regions in a structured latent space, PGOS adaptively generates high-quality pseudo-OOD graphs to refine decision boundaries. Integrating graph neural networks, reinforcement learning, latent space modeling, and graph decoding techniques, PGOS establishes an end-to-end anomaly synthesis pipeline. The method achieves state-of-the-art performance across multiple graph OOD and anomaly detection benchmarks, significantly enhancing detection robustness.

Technology Category

Application Category

📝 Abstract
Detecting out-of-distribution (OOD) graphs is crucial for ensuring the safety and reliability of Graph Neural Networks. In unsupervised graph-level OOD detection, models are typically trained using only in-distribution (ID) data, resulting in incomplete feature space characterization and weak decision boundaries. Although synthesizing outliers offers a promising solution, existing approaches rely on fixed, non-adaptive sampling heuristics (e.g., distance- or density-based), limiting their ability to explore informative OOD regions. We propose a Policy-Guided Outlier Synthesis (PGOS) framework that replaces static heuristics with a learned exploration strategy. Specifically, PGOS trains a reinforcement learning agent to navigate low-density regions in a structured latent space and sample representations that most effectively refine the OOD decision boundary. These representations are then decoded into high-quality pseudo-OOD graphs to improve detector robustness. Extensive experiments demonstrate that PGOS achieves state-of-the-art performance on multiple graph OOD and anomaly detection benchmarks.
Problem

Research questions and friction points this paper is trying to address.

out-of-distribution detection
graph neural networks
unsupervised learning
outlier synthesis
decision boundary
Innovation

Methods, ideas, or system contributions that make the work stand out.

Policy-Guided Outlier Synthesis
Graph Out-of-Distribution Detection
Reinforcement Learning
Latent Space Exploration
Pseudo-OOD Graph Generation
🔎 Similar Papers
No similar papers found.
Li Sun
Li Sun
East China Normal University
Image processingComputer vision
L
Lanxu Yang
North China Electric Power University
J
Jiayu Tian
Beijing University of Posts and Telecommunications
B
Bowen Fang
Tsinghua University
X
Xiaoyan Yu
Beijing Institute of Technology
J
Junda Ye
Beijing University of Posts and Telecommunications
Peng Tang
Peng Tang
Meta
Multi-modal LLMVision LanguageComputer Vision
Hao Peng
Hao Peng
Beihang University, Professor
Social Event DetectionAnomaly DetectionReinforcement Learning
Philip S. Yu
Philip S. Yu
Professor of Computer Science, University of Illinons at Chicago
Data miningDatabasePrivacy