CyberBOT: Towards Reliable Cybersecurity Education via Ontology-Grounded Retrieval Augmented Generation

📅 2025-04-01
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Large language models (LLMs) often generate inaccurate or unsafe responses when applied to cybersecurity education, posing risks to pedagogical integrity and student safety. Method: This paper proposes a multi-stage Retrieval-Augmented Generation (RAG) question-answering system integrated with a domain-specific cybersecurity ontology. The ontology serves as both a reasoning constraint and a verification layer within the RAG pipeline, enabling semantic consistency checking and security compliance validation. The system synergistically combines course material retrieval, domain-adapted LLM fine-tuning, formal ontology modeling, and rule-driven response verification. Contribution/Results: Deployed in a graduate-level cybersecurity course at Arizona State University involving over 100 students, the system demonstrates significant improvements in answer credibility and instructional relevance. It establishes a reproducible methodology and practical paradigm for deploying trustworthy AI in high-stakes technical education domains.

Technology Category

Application Category

📝 Abstract
Advancements in large language models (LLMs) have enabled the development of intelligent educational tools that support inquiry-based learning across technical domains. In cybersecurity education, where accuracy and safety are paramount, systems must go beyond surface-level relevance to provide information that is both trustworthy and domain-appropriate. To address this challenge, we introduce CyberBOT, a question-answering chatbot that leverages a retrieval-augmented generation (RAG) pipeline to incorporate contextual information from course-specific materials and validate responses using a domain-specific cybersecurity ontology. The ontology serves as a structured reasoning layer that constrains and verifies LLM-generated answers, reducing the risk of misleading or unsafe guidance. CyberBOT has been deployed in a large graduate-level course at Arizona State University (ASU), where more than one hundred students actively engage with the system through a dedicated web-based platform. Computational evaluations in lab environments highlight the potential capacity of CyberBOT, and a forthcoming field study will evaluate its pedagogical impact. By integrating structured domain reasoning with modern generative capabilities, CyberBOT illustrates a promising direction for developing reliable and curriculum-aligned AI applications in specialized educational contexts.
Problem

Research questions and friction points this paper is trying to address.

Ensuring accurate and safe cybersecurity education via AI
Combining retrieval-augmented generation with domain-specific ontology
Reducing misleading guidance in technical learning tools
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses retrieval-augmented generation (RAG) pipeline
Validates responses via cybersecurity ontology
Integrates structured reasoning with generative AI
🔎 Similar Papers
No similar papers found.
Chengshuai Zhao
Chengshuai Zhao
CS Ph.D. @ Arizona State University
Data MiningAI for ScienceLarge Language Model
R
Riccardo De Maria
School of Computing and Augmented Intelligence, Arizona State University
Tharindu Kumarage
Tharindu Kumarage
Applied Scientist, Amazon AGI Foundations
Responsible AIAI ForensicsAI for Social GoodNLP for Mental HealthAnomaly Detection
K
Kumar Satvik Chaudhary
School of Computing and Augmented Intelligence, Arizona State University
G
Garima Agrawal
School of Computing and Augmented Intelligence, Arizona State University
Y
Yiwen Li
Mary Lou Fulton Teachers College, Arizona State University
Jongchan Park
Jongchan Park
Lunit Inc.
Yuli Deng
Yuli Deng
Arizona State University
Computer Science
Y
Ying-Chih Chen
Mary Lou Fulton Teachers College, Arizona State University
H
Huan Liu
School of Computing and Augmented Intelligence, Arizona State University