The"I"in FAIR: Translating from Interoperability in Principle to Interoperation in Practice

📅 2026-01-15
📈 Citations: 0
Influential: 0
📄 PDF

career value

184K/year
🤖 AI Summary
Although scientific data increasingly adhere to the FAIR principles and employ standardized identifiers, practical interoperability remains hindered by heterogeneity in identifier systems and data models. This work proposes and implements two synergistic tools—Babel and ORION—to bridge this gap. Babel constructs clusters of equivalent identifiers through mapping-based clustering and exposes them via a high-performance quantitative API, while ORION standardizes heterogeneous knowledge bases by aligning them to a community-governed common data model. Together, they systematically address the longstanding disconnect between the FAIR “Interoperable” principle and its real-world implementation. The integration of these tools has enabled the construction of a fully interoperable knowledge base, substantially enhancing cross-resource data integration and query capabilities. The resulting framework is publicly available.

Technology Category

Application Category

📝 Abstract
The FAIR (Findable, Accessible, Interoperable, and Reusable) data principles [1] promote the interoperability of scientific data by encouraging the use of persistent identifiers, standardized vocabularies, and formal metadata structures. Many resources are created using vocabularies that are FAIR-compliant and well-annotated, yet the collective ecosystem of these resources often fails to interoperate effectively in practice. This continued challenge is mainly due to variation in identifier schemas and data models used in these resources. We have created two tools to bridge the chasm between interoperability in principle and interoperation in practice. Babel solves the problem of multiple identifier schemes by producing a curated set of identifier mappings to create cliques of equivalent identifiers that are exposed through high-performance APIs. ORION solves the problems of multiple data models by ingesting knowledge bases and transforming them into a common, community-managed data model. Here, we describe Babel and ORION and demonstrate their ability to support data interoperation. A library of fully interoperable knowledge bases created through the application of Babel and ORION is available for download and use at https://robokop.renci.org.
Problem

Research questions and friction points this paper is trying to address.

FAIR
Interoperability
Identifier Schemas
Data Models
Scientific Data
Innovation

Methods, ideas, or system contributions that make the work stand out.

FAIR data
interoperability
identifier mapping
data harmonization
knowledge integration
🔎 Similar Papers
No similar papers found.
E
Evan Morris
Renaissance Computing Institute, University of North Carolina at Chapel Hill, North Carolina, USA 27599
G
Gaurav Vaidya
Renaissance Computing Institute, University of North Carolina at Chapel Hill, North Carolina, USA 27599
P
Phil Owen
Renaissance Computing Institute, University of North Carolina at Chapel Hill, North Carolina, USA 27599
J
Jason Reilly
Renaissance Computing Institute, University of North Carolina at Chapel Hill, North Carolina, USA 27599
K
K. Fecho
Copperline Professional Solutions, LLC, Pittsboro, North Carolina, USA 27312
Patrick Wang
Patrick Wang
Adjunct Associate Professor of Electrical and Computer Engineering, Duke University
machine learningsoftware engineering
Y
Yaphet Kebede
Renaissance Computing Institute, University of North Carolina at Chapel Hill, North Carolina, USA 27599
E
E. K. Carter
Renaissance Computing Institute, University of North Carolina at Chapel Hill, North Carolina, USA 27599
C
Christopher Bizon
Renaissance Computing Institute, University of North Carolina at Chapel Hill, North Carolina, USA 27599