Clinical Data Goes MEDS? Let's OWL make sense of it

📅 2026-01-07
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the lack of standardization and explicit semantic representation in clinical data, which hinders interoperability and reproducibility in machine learning. To overcome this limitation, the authors propose the first integration of the MEDS clinical event model with Semantic Web technologies, resulting in MEDS-OWL—a lightweight OWL ontology comprising 13 classes, 10 object properties, 20 data properties, and 24 axioms. They further develop the meds2rdf tool to automatically transform MEDS data into FAIR-aligned RDF graphs. The approach leverages SHACL constraints for validation and RDF graph representations to provide a reusable semantic layer that enables semantic enrichment, cross-system interoperability, and graph-based analytics of clinical data. The methodology is successfully validated on a synthetic dataset capturing care pathways of patients with ruptured aneurysms.

Technology Category

Application Category

📝 Abstract
The application of machine learning on healthcare data is often hindered by the lack of standardized and semantically explicit representation, leading to limited interoperability and reproducibility across datasets and experiments. The Medical Event Data Standard (MEDS) addresses these issues by introducing a minimal, event-centric data model designed for reproducible machine-learning workflows from health data. However, MEDS is defined as a data-format specification and does not natively provide integration with the Semantic Web ecosystem. In this article, we introduce MEDS-OWL, a lightweight OWL ontology that provides formal concepts and relations to represent MEDS datasets as RDF graphs. Additionally, we implemented meds2rdf, a Python conversion library that transforms MEDS events into RDF graphs, ensuring conformance with the ontology. We evaluate the proposed approach on two datasets: a synthetic clinical cohort describing care pathways for ruptured intracranial aneurysms, and a real-world subset of MIMIC-IV. To assess semantic consistency, we performed a SHACL validation against the resulting knowledge graphs. The first release of MEDS-OWL comprises 13 classes, 10 object properties, 20 data properties, and 24 OWL axioms. Combined with meds2rdf, it enables data transformation into FAIR-aligned datasets, provenance-aware publishing, and interoperability of event-based clinical data. By bridging MEDS with the Semantic Web, this work contributes a reusable semantic layer for event-based clinical data and establishes a robust foundation for subsequent graph-based analytics.
Problem

Research questions and friction points this paper is trying to address.

clinical data
semantic interoperability
MEDS
Semantic Web
FAIR data
Innovation

Methods, ideas, or system contributions that make the work stand out.

MEDS-OWL
Semantic Web
RDF
FAIR data
clinical event data
🔎 Similar Papers
No similar papers found.