PEMUTA: Pedagogically-Enriched Multi-Granular Undergraduate Thesis Assessment

📅 2025-07-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Current LLM-based automated evaluation of undergraduate thesis essays (UGTEs) yields only a single holistic score, failing to capture pedagogically critical dimensions—such as structural coherence, alignment with learning objectives, and multifaceted academic competencies. To address this, we propose SLOWPR, a teaching-oriented, fine-grained assessment framework. SLOWPR is the first to integrate Vygotsky’s Zone of Proximal Development and Bloom’s Taxonomy into prompt engineering, establishing a six-dimensional rubric covering structure, logic, domain knowledge, writing quality, reflective depth, and academic integrity. Leveraging hierarchical role-playing and few-shot in-context learning, it achieves alignment between large language models and pedagogical expertise without model fine-tuning. Experiments demonstrate strong inter-rater agreement between SLOWPR and human experts across all dimensions (mean Cohen’s κ = 0.82), significantly enhancing both the pedagogical validity and explanatory granularity of automated thesis evaluation.

Technology Category

Application Category

📝 Abstract
The undergraduate thesis (UGTE) plays an indispensable role in assessing a student's cumulative academic development throughout their college years. Although large language models (LLMs) have advanced education intelligence, they typically focus on holistic assessment with only one single evaluation score, but ignore the intricate nuances across multifaceted criteria, limiting their ability to reflect structural criteria, pedagogical objectives, and diverse academic competencies. Meanwhile, pedagogical theories have long informed manual UGTE evaluation through multi-dimensional assessment of cognitive development, disciplinary thinking, and academic performance, yet remain underutilized in automated settings. Motivated by the research gap, we pioneer PEMUTA, a pedagogically-enriched framework that effectively activates domain-specific knowledge from LLMs for multi-granular UGTE assessment. Guided by Vygotsky's theory and Bloom's Taxonomy, PEMUTA incorporates a hierarchical prompting scheme that evaluates UGTEs across six fine-grained dimensions: Structure, Logic, Originality, Writing, Proficiency, and Rigor (SLOWPR), followed by holistic synthesis. Two in-context learning techniques, ie, few-shot prompting and role-play prompting, are also incorporated to further enhance alignment with expert judgments without fine-tuning. We curate a dataset of authentic UGTEs with expert-provided SLOWPR-aligned annotations to support multi-granular UGTE assessment. Extensive experiments demonstrate that PEMUTA achieves strong alignment with expert evaluations, and exhibits strong potential for fine-grained, pedagogically-informed UGTE evaluations.
Problem

Research questions and friction points this paper is trying to address.

Addresses lack of multi-granular assessment in UGTE evaluations
Integrates pedagogical theories into automated LLM-based assessment
Enhances alignment with expert judgments using hierarchical prompting
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical prompting scheme for multi-granular assessment
Incorporates few-shot and role-play prompting techniques
Leverages Vygotsky's theory and Bloom's Taxonomy
🔎 Similar Papers
No similar papers found.
J
Jialu Zhang
Research Institute of Trustworthy Autonomous Systems and Department of Computer Science and Engineering, Southern University of Science and Technology, Guangdong 518055, China
Q
Qingyang Sun
Research Institute of Trustworthy Autonomous Systems and Department of Computer Science and Engineering, Southern University of Science and Technology, Guangdong 518055, China
Q
Qianyi Wang
Research Institute of Trustworthy Autonomous Systems and Department of Computer Science and Engineering, Southern University of Science and Technology, Guangdong 518055, China
W
Weiyi Zhang
Research Institute of Trustworthy Autonomous Systems and Department of Computer Science and Engineering, Southern University of Science and Technology, Guangdong 518055, China
Z
Zunjie Xiao
Research Institute of Trustworthy Autonomous Systems and Department of Computer Science and Engineering, Southern University of Science and Technology, Guangdong 518055, China
X
Xiaoqing Zhang
Research Institute of Trustworthy Autonomous Systems and Department of Computer Science and Engineering, Southern University of Science and Technology, Guangdong 518055, China
Jianfeng Ren
Jianfeng Ren
University of Nottingham Ningbo China
Computer VisionPattern RecognitionMachine LearningHuman-Computer Interaction
J
Jiang Liu
Research Institute of Trustworthy Autonomous Systems and Department of Computer Science and Engineering, Guangdong 518055, China; School of Computer Science, University of Nottingham Ningbo China, Zhejiang 315100, China; School of Ophthalmology and Optometry, Wenzhou Medical University, Zhejiang 325035, China