Can TabPFN Compete with GNNs for Node Classification via Graph Tabularization?

๐Ÿ“… 2025-12-09
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Graph node classification typically relies on graph neural networks (GNNs) or large language models, which require architecture-specific design and extensive training. Method: This paper proposes TabPFN-GN, the first framework to systematically reformulate node classification as a tabular learning taskโ€”leveraging the zero-shot generalization capability of the pre-trained tabular foundation model TabPFN. It constructs transferable graph-to-tabular representations by jointly encoding node attributes, structural topology, positional embeddings, and neighborhood-smoothed features, enabling end-to-end zero-shot inference without graph-specific architectures. Contribution/Results: TabPFN-GN establishes a novel paradigm for adapting general-purpose tabular models to graph data, eliminating dependence on GNNs or LLMs. Experiments across 12 benchmark datasets demonstrate that it matches state-of-the-art GNNs on homophilic graphs and consistently outperforms them on heterophilic graphs, validating the effectiveness and robustness of cross-domain zero-shot transfer.

Technology Category

Application Category

๐Ÿ“ Abstract
Foundation models pretrained on large data have demonstrated remarkable zero-shot generalization capabilities across domains. Building on the success of TabPFN for tabular data and its recent extension to time series, we investigate whether graph node classification can be effectively reformulated as a tabular learning problem. We introduce TabPFN-GN, which transforms graph data into tabular features by extracting node attributes, structural properties, positional encodings, and optionally smoothed neighborhood features. This enables TabPFN to perform direct node classification without any graph-specific training or language model dependencies. Our experiments on 12 benchmark datasets reveal that TabPFN-GN achieves competitive performance with GNNs on homophilous graphs and consistently outperforms them on heterophilous graphs. These results demonstrate that principled feature engineering can bridge the gap between tabular and graph domains, providing a practical alternative to task-specific GNN training and LLM-dependent graph foundation models.
Problem

Research questions and friction points this paper is trying to address.

Reformulates graph node classification as tabular learning problem
Introduces TabPFN-GN to transform graph data into tabular features
Evaluates performance against GNNs on homophilous and heterophilous graphs
Innovation

Methods, ideas, or system contributions that make the work stand out.

Transforms graph data into tabular features
Enables direct node classification without graph-specific training
Uses feature engineering to bridge tabular and graph domains
๐Ÿ”Ž Similar Papers
No similar papers found.