Automatic Classification of Pedagogical Materials against CS Curriculum Guidelines

📅 2026-02-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work proposes an automated classification approach that integrates traditional natural language processing techniques—such as syntactic parsing, part-of-speech tagging, and text embeddings—with large language models to efficiently align computer science course materials with the ACM/IEEE curriculum guidelines. Manual evaluation of such alignment is time-consuming and cognitively demanding; in contrast, the proposed method enables precise semantic-level categorization of instructional documents, significantly enhancing the efficiency of curriculum audits. By automating the mapping between course content and internationally recognized CS education standards, this approach offers a scalable technical solution to support quality assurance in computing education.

Technology Category

Application Category

📝 Abstract
Professional societies often publish curriculum guidelines to help programs align their content to international standards. In Computer Science, the primary standard is published by ACM and IEEE and provide detailed guidelines for what should be and could be included in a Computer Science program. While very helpful, it remains difficult for program administrators to assess how much of the guidelines is being covered by a CS program. This is in particular due to the extensiveness of the guidelines, containing thousands of individual items. As such, it is time consuming and cognitively demanding to audit every course to confidently mark everything that is actually being covered. Our preliminary work indicated that it takes about a day of work per course. In this work, we propose using Natural Language Processing techniques to accelerate the process. We explore two kinds of techniques, the first relying on traditional tools for parsing, tagging, and embeddings, while the second leverages the power of Large Language Models. We evaluate the application of these techniques to classify a corpus of pedagogical materials and show that we can meaningfully classify documents automatically.
Problem

Research questions and friction points this paper is trying to address.

Curriculum Alignment
Pedagogical Materials
CS Curriculum Guidelines
Course Coverage Assessment
Educational Standards
Innovation

Methods, ideas, or system contributions that make the work stand out.

Natural Language Processing
Curriculum Alignment
Large Language Models
Automatic Classification
CS Education
🔎 Similar Papers
No similar papers found.
Erik Saule
Erik Saule
Professor of Computer Science, UNC Charlotte
High Performance ComputingSchedulingGraph AlgorithmsApproximation AlgorithmsMulti Objective
K
K. Subramanian
Department of Computer Science, The University of North Carolina at Charlotte
R
Razvan C. Bunescu
Department of Computer Science, The University of North Carolina at Charlotte