๐ค AI Summary
The topological deep learning (TDL) field lacks a unified benchmarking framework, hindering model comparability and research reproducibility.
Method: We propose TDL-Benchโthe first modular, open-source benchmarking framework for TDL. It decouples the full pipeline into data loading, topological domain transformations (e.g., graphs โ simplicial complexes โ cell complexes), feature lifting, model training, and evaluation. Crucially, it introduces a novel operator library supporting reversible mappings between arbitrary topological domains and high-order structural embeddings. It integrates simplicial and cell complex embedding modules alongside a standardized evaluation protocol.
Results: We systematically evaluate mainstream TDL models across multiple datasets. Experiments demonstrate that TDL-Bench significantly enhances representation granularity and analytical flexibility, accelerating TDL algorithm development, validation, and deployment. By establishing a standardized, extensible benchmarking paradigm, TDL-Bench lays the foundation for rigorous, reproducible progress in topological deep learning.
๐ Abstract
This work introduces TopoBenchmarkX, a modular open-source library designed to standardize benchmarking and accelerate research in Topological Deep Learning (TDL). TopoBenchmarkX maps the TDL pipeline into a sequence of independent and modular components for data loading and processing, as well as model training, optimization, and evaluation. This modular organization provides flexibility for modifications and facilitates the adaptation and optimization of various TDL pipelines. A key feature of TopoBenchmarkX is that it allows for the transformation and lifting between topological domains. This enables, for example, to obtain richer data representations and more fine-grained analyses by mapping the topology and features of a graph to higher-order topological domains such as simplicial and cell complexes. The range of applicability of TopoBenchmarkX is demonstrated by benchmarking several TDL architectures for various tasks and datasets.