The AnIML Ontology: Enabling Semantic Interoperability for Large-Scale Experimental Data in Interconnected Scientific Labs

📅 2026-04-02
📈 Citations: 0
Influential: 0
📄 PDF

career value

187K/year
🤖 AI Summary
This study addresses semantic interoperability challenges arising from inconsistent semantic representations of the AnIML standard across heterogeneous experimental data systems. To resolve this, the work presents the first formalization of the AnIML standard as an OWL 2 ontology, aligned with the Allotrope Data Format. A collaborative ontology engineering methodology is introduced, integrating domain expert input with large language model–assisted requirements elicitation. Furthermore, a novel validation protocol is devised, leveraging adversarial negative capability queries and SHACL constraints to ensure ontological rigor. The approach successfully transforms real-world AnIML files into a knowledge graph, enabling SPARQL-based querying and knowledge graph transformation techniques that effectively preserve ontological consistency and facilitate cross-system interoperability.

Technology Category

Application Category

📝 Abstract
Achieving semantic interoperability across heterogeneous experimental data systems remains a major barrier to data-driven scientific discovery. The Analytical Information Markup Language (AnIML), a flexible XML-based standard for analytical chemistry and biology, is increasingly used in industrial R&D labs for managing and exchanging experimental data. However, the expressivity of the XML schema permits divergent interpretations across stakeholders, introducing inconsistencies that undermine the interoperability the AnIML schema was designed to support. In this paper, we present the AnIML Ontology, an OWL 2 ontology that formalises the semantics of AnIML and aligns it with the Allotrope Data Format to support future cross-system and cross-lab interoperability. The ontology was developed using an expert-in-the-loop approach combining LLM-assisted requirement elicitation with collaborative ontology engineering. We validate the ontology through a multi-layered approach: data-driven transformation of real-world AnIML files into knowledge graphs, competency question verification via SPARQL, and a novel validation protocol based on adversarial negative competency questions mapped to established ontological anti-patterns and enforced via SHACL constraints.
Problem

Research questions and friction points this paper is trying to address.

semantic interoperability
AnIML
experimental data
ontology
heterogeneous data systems
Innovation

Methods, ideas, or system contributions that make the work stand out.

AnIML Ontology
semantic interoperability
OWL 2
LLM-assisted ontology engineering
SHACL validation
🔎 Similar Papers
No similar papers found.
💼 Related Jobs
AI Data Engineer--LLMs / Agentic Systems
Pfizer
The annual base salary for this position ranges from $106,000.00 to $176,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 15.0% of the base salary and eligibility to participate in our share based long term incentive program. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
United States - Massachusetts - Cambridge
W
Wilf Morlidge
School of Computer Science & Informatics, University of Liverpool, UK
E
Elliott Watkiss-Leek
School of Computer Science & Informatics, University of Liverpool, UK
G
George Hannah
School of Computer Science & Informatics, University of Liverpool, UK
H
Harry Rostron
Unilever Plc. Materials Innovation Factory, University of Liverpool, UK
Andrew Ng
Andrew Ng
Stanford University
Machine LearningDeep LearningAI
E
Ewan Johnson
Unilever Plc. Materials Innovation Factory, University of Liverpool, UK
A
Andrew Mitchell
Unilever Plc. Materials Innovation Factory, University of Liverpool, UK
Terry R. Payne
Terry R. Payne
University of Liverpool, Department of Computer Science, Agents Group
Artificial IntelligenceMulti Agent SystemsOntologiesSemantic Web
Valentina Tamma
Valentina Tamma
University of Liverpool, Department of Computer Science, Agents Group
ontologiesontology engineeringontologies in MASartificial intelligence
Jacopo de Berardinis
Jacopo de Berardinis
Lecturer in Computer Science, University of Liverpool
Music InformaticsKnowledge EngineeringMachine Learning