IDEA2: Expert-in-the-loop competency question elicitation for collaborative ontology engineering

📅 2026-04-01
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the resource-intensive bottleneck in ontology engineering caused by poor communication between domain experts and ontology engineers during competency question (CQ) elicitation. The authors propose a semi-automated workflow that integrates large language models (LLMs) into an expert-in-the-loop process: initial CQs are automatically generated from requirement documents, then iteratively refined by the LLM based on expert feedback provided through an interactive platform until consensus is reached. This approach introduces, for the first time, an iterative CQ generation mechanism combining LLM capabilities with expert collaboration, complemented by a provenance model that fully traces the CQ lifecycle to ensure transparency and reproducibility. Evaluations in real-world scientific data and cultural heritage scenarios demonstrate that the method significantly accelerates requirements engineering and enhances CQ acceptability, relevance, and usability for domain experts.
📝 Abstract
Competency question (CQ) elicitation represents a critical but resource-intensive bottleneck in ontology engineering. This foundational phase is often hampered by the communication gap between domain experts, who possess the necessary knowledge, and ontology engineers, who formalise it. This paper introduces IDEA2, a novel, semi-automated workflow that integrates Large Language Models (LLMs) within a collaborative, expert-in-the-loop process to address this challenge. The methodology is characterised by a core iterative loop: an initial LLM-based extraction of CQs from requirement documents, a co-creational review and feedback phase by domain experts on an accessible collaborative platform, and an iterative, feedback-driven reformulation of rejected CQs by an LLM until consensus is achieved. To ensure transparency and reproducibility, the entire lifecycle of each CQ is tracked using a provenance model that captures the full lineage of edits, anonymised feedback, and generation parameters. The workflow was validated in 2 real-world scenarios (scientific data, cultural heritage), demonstrating that IDEA2 can accelerate the requirements engineering process, improve the acceptance and relevance of the resulting CQs, and exhibit high usability and effectiveness among domain experts. We release all code and experiments at https://github.com/KE-UniLiv/IDEA2
Problem

Research questions and friction points this paper is trying to address.

competency question
ontology engineering
expert-in-the-loop
requirements elicitation
communication gap
Innovation

Methods, ideas, or system contributions that make the work stand out.

Competency Question Elicitation
Expert-in-the-loop
Large Language Models
Ontology Engineering
Provenance Tracking
🔎 Similar Papers
No similar papers found.
E
Elliott Watkiss-Leek
School of Computer Science and Informatics, University of Liverpool, Brownlow Hill, Liverpool, L69 7ZX, UK
Reham Alharbi
Reham Alharbi
Lecturer at the Department of Information Systems, Taibah University
OntologiesOntology engineeringHealth informaticsArtificial intelligence
H
Harry Rostron
Unilever Plc., Materials Innovation Factory, University of Liverpool, 51 Oxford Street, Liverpool, L7 3NY, UK
Andrew Ng
Andrew Ng
Stanford University
Machine LearningDeep LearningAI
E
Ewan Johnson
Unilever Plc., Materials Innovation Factory, University of Liverpool, 51 Oxford Street, Liverpool, L7 3NY, UK
A
Andrew Mitchell
Unilever Plc., Materials Innovation Factory, University of Liverpool, 51 Oxford Street, Liverpool, L7 3NY, UK
Terry R. Payne
Terry R. Payne
University of Liverpool, Department of Computer Science, Agents Group
Artificial IntelligenceMulti Agent SystemsOntologiesSemantic Web
Valentina Tamma
Valentina Tamma
University of Liverpool, Department of Computer Science, Agents Group
ontologiesontology engineeringontologies in MASartificial intelligence
Jacopo de Berardinis
Jacopo de Berardinis
Lecturer in Computer Science, University of Liverpool
Music InformaticsKnowledge EngineeringMachine Learning