Research Knowledge Graphs in NFDI4DataScience: Key Activities, Achievements, and Future Directions

πŸ“… 2025-08-04
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
To address low transparency, poor reproducibility, and weak discoverability in AI and data science research, this paper proposes and implements a semantic-driven research knowledge graph framework. Methodologically, we design and deploy the NFDI4DS ontology and standardized metadata schema, integrating community-shared vocabularies, automated information extraction techniques, and a modular knowledge graph construction pipeline to enable cross-modal semantic integration of datasets, models, software, and publications. Key contributions include: (1) the first community-developed, full-stack ontology for data science artifacts; (2) a scalable, FAIR-compliant architecture for knowledge interlinking; and (3) an open-source toolchain already adopted in multiple real-world research projects. Evaluation results demonstrate significant improvements in machine interpretability, cross-platform interoperability, and computational reproducibility of research assets.

Technology Category

Application Category

πŸ“ Abstract
As research in Artificial Intelligence and Data Science continues to grow in volume and complexity, it becomes increasingly difficult to ensure transparency, reproducibility, and discoverability. To address these challenges, as research artifacts should be understandable and usable by machines, the NFDI4DataScience consortium is developing and providing Research Knowledge Graphs (RKGs). Building upon earlier works, this paper presents recent progress in creating semantically rich RKGs using standardized ontologies, shared vocabularies, and automated Information Extraction techniques. Key achievements include the development of the NFDI4DS ontology, metadata standards, tools, and services designed to support the FAIR principles, as well as community-led projects and various implementations of RKGs. Together, these efforts aim to capture and connect the complex relationships between datasets, models, software, and scientific publications.
Problem

Research questions and friction points this paper is trying to address.

Ensuring transparency and reproducibility in AI research
Developing machine-understandable research knowledge graphs
Connecting datasets, models, and publications semantically
Innovation

Methods, ideas, or system contributions that make the work stand out.

Research Knowledge Graphs using standardized ontologies
Automated Information Extraction techniques
FAIR principles with metadata standards
πŸ”Ž Similar Papers
No similar papers found.
K
Kanishka Silva
GESIS – Leibniz Institute for the Social Sciences, Cologne, Germany
M
Marcel R. Ackermann
DBLP computer science bibliography, Schloss Dagstuhl - LZI, Trier, Germany
Heike Fliegl
Heike Fliegl
Senior Researcher at FIZ Karlsruhe
Information Service EngineeringTheoretical ChemistryMagnetically Induced Ring Currents
G
Genet-Asefa Gesese
FIZ Karlsruhe – Leibniz Institute for Information Infrastructure GmbH, Eggenstein-Leopoldshafen, Germany
F
Fidan Limani
ZBW – Leibniz Information Centre for Economics, Kiel, Germany
Philipp Mayr
Philipp Mayr
GESIS - Leibniz Institute for the Social Sciences
Interactive Information RetrievalInformetricsDigital librariesInformation SeekingDataset Search
P
Peter Mutschke
GESIS – Leibniz Institute for the Social Sciences, Cologne, Germany
A
Allard Oelen
TIB – Leibniz Information Centre for Science and Technology, Hannover, Germany
M
Muhammad Asif Suryani
GESIS – Leibniz Institute for the Social Sciences, Cologne, Germany
Sharmila Upadhyaya
Sharmila Upadhyaya
Gesis Leibniz Institute
B
Benjamin Zapilko
GESIS – Leibniz Institute for the Social Sciences, Cologne, Germany
Harald Sack
Harald Sack
FIZ Karlsruhe - Leibniz Institute for Information Infrastructure & Karlsruhe Institute for
Semantic WebKnowledge EngineeringMultimedia RetrievalData MiningDigital Archives
Stefan Dietze
Stefan Dietze
Full Professor (Heinrich-Heine-University DΓΌsseldorf) & Scientific Director (KTS, GESIS)
Knowledge GraphsInformation RetrievalWeb ScienceNLP