Position: Graph Condensation Needs a Reset -- Move Beyond Full-dataset Training and Model-Dependence

📅 2026-05-17
📈 Citations: 0
Influential: 0
📄 PDF

career value

227K/year
🤖 AI Summary
Existing graph neural network (GNN) compression methods typically rely on full-graph training and model-specific architectures, resulting in high computational overhead, limited generalization, and deployment challenges. This work systematically analyzes the fundamental shortcomings of current compression paradigms in methodology, evaluation protocols, and practical applicability. We propose a novel direction that entirely dispenses with full-graph training and model dependency, advocating for lightweight, architecture-agnostic graph compression techniques. Furthermore, we advocate a restructured evaluation framework centered on genuine resource savings and deployability, thereby establishing a theoretical foundation for efficient and scalable GNN training.
📝 Abstract
Graph Neural Networks (GNNs) are powerful tools for learning from graph-structured data, but their scalability is increasingly strained by the size of real-world graphs in domains like recommender systems, fraud detection, and molecular biology. Graph condensation -- the task of generating a smaller synthetic graph that retains the performance of models trained on the original -- has emerged as a promising solution. However, the dominant approach of gradient matching introduces a fundamental contradiction: it requires training on the full dataset to create the compressed version, thereby undermining the goal of efficiency. Worse still, these methods suffer from high computational overhead, poor generalization across GNN architectures, and brittle reliance on specific model configurations. Equally concerning is the community's reliance on misleading evaluation protocols such as node compression ratios, which fail to reflect true resource savings, condensation overhead, and illusory application to neural architecture search. These shortcomings are not incidental -- they are systemic, and they obstruct meaningful progress. In this position paper, we argue that graph condensation, in its current form, needs a reset. We call for moving beyond full-dataset training and model-dependent design, and instead advocate for methods that are lightweight, architecture-agnostic, and practically deployable. By identifying key methodological flaws and outlining concrete research directions, we aim to reorient the field toward approaches that deliver on the true promise of condensation: efficient, generalizable, and usable GNN training at scale.
Problem

Research questions and friction points this paper is trying to address.

graph condensation
scalability
model-dependence
evaluation protocol
computational overhead
Innovation

Methods, ideas, or system contributions that make the work stand out.

graph condensation
model-agnostic
scalable GNNs
efficient training
synthetic graph generation