The CTU Prague Relational Learning Repository

📅 2015-11-10

📈 Citations: 0

✨ Influential: 0

career value

166K/year

🤖 AI Summary

The relational learning community lacks high-quality, standardized multi-relational benchmark datasets. Method: We construct the first systematic, open-source repository of multi-relational datasets, integrating 148 real-world SQL databases (in MySQL format) with diverse table schemas and relationship patterns. We propose a unified metadata modeling framework encompassing standardized schema descriptions, self-relationship statistics, and semantic annotations. Additionally, we provide a searchable web interface to support efficient dataset discovery and empirical evaluation. Contribution/Results: This repository fills a critical gap in benchmarking for relational learning, graph neural networks, and inductive logic programming—domains previously constrained by synthetic or narrow-scale data. It enables rigorous, reproducible model development and algorithmic assessment on authentic, heterogeneous relational structures. The resource has been widely adopted in both academic research and industrial applications for training and evaluating relational AI systems.

📝 Abstract

The aim of the Prague Relational Learning Repository is to support machine learning research with multi-relational data. The repository currently contains 148 SQL databases hosted on a public MySQL server located at https://relational.fel.cvut.cz. The server is provided by the Czech Technical University (CTU). A searchable meta-database provides metadata (e.g., the number of tables in the database, the number of rows and columns in the tables, the number of self-relationships).

Problem

Research questions and friction points this paper is trying to address.

Supports machine learning research with multi-relational data

Provides 148 SQL databases on a public MySQL server

Offers searchable metadata for relational database analysis

Innovation

Methods, ideas, or system contributions that make the work stand out.

Public MySQL server for SQL databases

Searchable meta-database with metadata

148 multi-relational datasets available

🔎 Similar Papers

No similar papers found.

ByteDance

圣何塞

Research Engineer / Scientist -AI for Databases

ByteDance

西雅图

Research Scientist Intern, Multimodal Generative AI and Robotics (PhD)