Are Video Models Zero-Shot Learners and Reasoners in Education? EduVideoBench, A Knowledge-Skills-Attitude Benchmark for Educational Video Generation

📅 2026-05-26
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the lack of systematic evaluation of educational efficacy in existing video generation models, which predominantly focus on perceptual quality or general safety. The authors propose EduVideoBench—the first benchmark for evaluating educational video generation grounded in the Knowledge-Skills-Attitudes (KSA) framework from educational theory. By integrating KSA into generative model assessment, they establish a multidimensional, structured evaluation system for instructional appropriateness and educational safety. Through expert review and qualitative analysis, five state-of-the-art models are systematically assessed, revealing significant deficiencies in knowledge accuracy, skill demonstration, and attitudinal appropriateness. The findings indicate that misalignment in any single KSA dimension can render the generated content educationally ineffective, underscoring a substantial gap between current models and practical classroom deployment.
📝 Abstract
Video generation models (VGMs) are rapidly entering classrooms, yet existing benchmarks evaluate only perceptual quality, intrinsic faithfulness, generic safety, or video as a reasoning medium, and none assesses whether the outputs are educationally valid. In this work, we present EduVideoBench, the first balanced benchmark in the education domain, grounded in the Knowledge-Skills-Attitude (KSA) framework so that pedagogical adequacy and educational safety are evaluated jointly rather than as ad-hoc quality dimensions. Across five frontier VGMs, our results show substantial room for improvement across knowledge, skills, and attitude before they are classroom-ready. We complement this with a qualitative analysis of expert comments, finding that educational validity is multi-component, where a single misaligned element such as pacing, legibility, or notation can invalidate an otherwise correct video. We hope EduVideoBench will guide the development of VGMs that are pedagogically grounded and safe for the classroom.
Problem

Research questions and friction points this paper is trying to address.

educational validity
video generation models
benchmark
pedagogical adequacy
educational safety
Innovation

Methods, ideas, or system contributions that make the work stand out.

educational video generation
Knowledge-Skills-Attitude (KSA) framework
pedagogical evaluation
video generation models
educational validity
🔎 Similar Papers
U
Unggi Lee
Korea University Sejong Campus
H
Hoyoung Ahn
Cardiff Metropolitan University
Y
Yoon Choi
Seoul National University
S
Seonmin Eun
Seoul National University
J
Jahyun Jeong
Seoul National University
S
Seonmin Jin
Bugil Academy
H
Harmony Jung
Gyeonggi Provincial Office of Education
Hye Jin Kim
Hye Jin Kim
Assistant professor, Department of Biomedical Engineering, Yonsei University
Soft conductorBiomedical engineeringNanobiosensor
C
Chaerin Lee
Seoul National University
Hyunji Lee
Hyunji Lee
KAIST
Jeongjin Lee
Jeongjin Lee
The Ohio State University
StatisticsBiostatistics
Soohwan Lee
Soohwan Lee
Ph.D Candidate at UNIST
Human-computer InteractionHuman-centered AIConversational AIGroup Dynamics
Y
Young-Seok Oh
Korea University
J
Jaehyeon Park
Seoul National University
S
Sun-ok Ryu
Sungshin Women’s University
S
Sunyoung Shin
Seoul National University
Y
Yoorim Son
Seoul National University of Education
H
Haeun Park
Korea Institute for Curriculum and Evaluation
Yeil Jeong
Yeil Jeong
Indiana University
AI in EducationHuman-AI InteractionDomain-specific LLMs