🤖 AI Summary
In online education, precise alignment between learning resources and objectives relies on costly manual curation, hindering scalable personalized instruction. To address this, we propose the first text-embedding-based automated alignment evaluation framework, systematically benchmarking multiple models on educational alignment tasks and demonstrating that semantic similarity reliably predicts learning outcomes. Leveraging the Voyage embedding model, we achieve fine-grained semantic matching between learning objectives and resources, supported by a rigorous evidence chain comprising model evaluation, expert validation, and empirical testing. Three randomized controlled trials (N = 360) show that our framework achieves 79% alignment identification accuracy on human-curated resources and 83% on LLM-generated resources. Critically, high-alignment resources significantly improve learning performance (χ² = 15.39, p < 0.001). This work establishes a novel, low-cost, and scalable paradigm for intelligent educational resource filtering.
📝 Abstract
As the online learning landscape evolves, the need for personalization is increasingly evident. Although educational resources are burgeoning, educators face challenges selecting materials that both align with intended learning outcomes and address diverse learner needs. Large Language Models (LLMs) are attracting growing interest for their potential to create learning resources that better support personalization, but verifying coverage of intended outcomes still requires human alignment review, which is costly and limits scalability. We propose a framework that supports the cost-effective automation of evaluating alignment between educational resources and intended learning outcomes. Using human-generated materials, we benchmarked LLM-based text-embedding models and found that the most accurate model (Voyage) achieved 79% accuracy in detecting alignment. We then applied the optimal model to LLM-generated resources and, via expert evaluation, confirmed that it reliably assessed correspondence to intended outcomes (83% accuracy). Finally, in a three-group experiment with 360 learners, higher alignment scores were positively related to greater learning performance, chi-squared(2, N = 360) = 15.39, p < 0.001. These findings show that embedding-based alignment scores can facilitate scalable personalization by confirming alignment with learning outcomes, which allows teachers to focus on tailoring content to diverse learner needs.