A Socratic RAG Approach to Connect Natural Language Queries on Research Topics with Knowledge Organization Systems

📅 2025-02-20
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the imprecise mapping between natural-language research queries and fine-grained semantic entities in Knowledge Organization Systems (KOS), aiming to bridge the semantic gap between users’ intuitive understanding and large-scale bibliometric repositories. To this end, we propose a novel Socratic-dialogue-driven Retrieval-Augmented Generation (RAG) framework that iteratively refines query semantics through multi-turn clarification and augments generation with targeted KOS entity retrieval, enabling high-precision, interpretable natural-language-to-KOS entity mapping. Leveraging this framework, we construct CollabNext—a human-centered knowledge graph specifically designed for Historically Black Colleges and Universities (HBCUs) and emerging scholars—to enhance visibility and traceable collaboration for historically marginalized academic communities. Experimental results demonstrate significant improvements in cross-disciplinary topic discovery accuracy and scholarly relationship inference, establishing a new paradigm for equitable, transparent, and explainable intelligent scholarly services.

Technology Category

Application Category

📝 Abstract
In this paper, we propose a Retrieval Augmented Generation (RAG) agent that maps natural language queries about research topics to precise, machine-interpretable semantic entities. Our approach combines RAG with Socratic dialogue to align a user's intuitive understanding of research topics with established Knowledge Organization Systems (KOSs). The proposed approach will effectively bridge"little semantics"(domain-specific KOS structures) with"big semantics"(broad bibliometric repositories), making complex academic taxonomies more accessible. Such agents have the potential for broad use. We illustrate with a sample application called CollabNext, which is a person-centric knowledge graph connecting people, organizations, and research topics. We further describe how the application design has an intentional focus on HBCUs and emerging researchers to raise visibility of people historically rendered invisible in the current science system.
Problem

Research questions and friction points this paper is trying to address.

Map natural language queries to semantic entities
Align user understanding with Knowledge Organization Systems
Bridge domain-specific KOS with broad repositories
Innovation

Methods, ideas, or system contributions that make the work stand out.

RAG agent maps queries
Socratic dialogue aligns understanding
Bridges domain KOS with repositories
🔎 Similar Papers
No similar papers found.
L
L. Lefton
Georgia Institute of Technology
Kexin Rong
Kexin Rong
School of Computer Science, Georgia Institute of Technology
DataData ManagementData Systems
C
Chinar Dankhara
Georgia Institute of Technology
L
Lila Ghemri
Texas Southern University
Firdous Kausar
Firdous Kausar
School of Applied Computational Sciences, Meharry Medical College
BlockchainIoTMachine LearningDigital ForensicsCyber Security
A
A. H. Hamdallahi
Fisk University