🤖 AI Summary
This work addresses the challenge of applying graph neural networks (GNNs) to tabular data, which inherently lacks a graph structure. The authors propose a novel instance-level graph construction method that leverages the proximity between samples induced by random forests, thereby utilizing tree-based model outputs to define graph edges without imposing strong assumptions on the underlying feature geometry. This approach uniquely integrates the nonlinear feature interaction capabilities of random forests with the structural learning strengths of GNNs. Evaluated across 36 benchmark datasets, the method achieves significantly higher weighted F1 scores compared to established baselines and existing graph construction strategies, demonstrating both its effectiveness and robustness.
📝 Abstract
Graphs are essential for modeling complex relationships and capturing structured interactions in data. Graph Neural Networks (GNNs) are particularly effective when such relational structure is explicitly available, but many real-world datasets, most notably tabular data, lack an inherent graph representation. To address this limitation, we propose RF-GNN, a framework that constructs instance-level graphs from tabular data using proximity measures induced by random forests. These proximities capture nonlinear feature interactions and data-adaptive similarity without imposing restrictive assumptions on feature geometry. The resulting graphs enable the direct application of GNNs to tabular learning problems. Extensive experiments on 36 benchmark datasets demonstrate that RF-GNN consistently outperforms strong classical baselines and recent graph-construction methods in terms of weighted F1-score. Additional ablation studies highlight the impact of proximity design choices and graph construction settings.