🤖 AI Summary
Generative AI in K–12 STEM education often suffers from misalignment with curriculum standards, mismatched reading levels, opaque domain-specific terminology, and low student engagement. To address these challenges, this study introduces the first controllable generation framework that jointly integrates three curricular pillars—scientific concepts, crosscutting ideas, and learning objectives—with a “curiosity-driven” narrative mechanism. The framework enables coordinated control over text length, lexical difficulty, and syntactic complexity to simultaneously optimize content appropriateness, readability, and pedagogical appeal. Evaluation employs a dual-track paradigm combining LLM-as-a-judge metrics with domain-expert human assessment. Experimental results demonstrate that generated materials match or exceed human-authored baselines in grade-level alignment, scientific accuracy, and instructional utility—thereby significantly enhancing the teachability and learnability of introductory STEM content.
📝 Abstract
While Generative AI has demonstrated strong potential and versatility in content generation, its application to educational contexts presents several challenges. Models often fail to align with curriculum standards and maintain grade-appropriate reading levels consistently. Furthermore, STEM education poses additional challenges in balancing scientific explanations with everyday language when introducing complex and abstract ideas and phenomena to younger students. In this work, we propose COGENT, a curriculum-oriented framework for generating grade-appropriate educational content. We incorporate three curriculum components (science concepts, core ideas, and learning objectives), control readability through length, vocabulary, and sentence complexity, and adopt a ``wonder-based'' approach to increase student engagement and interest. We conduct a multi-dimensional evaluation via both LLM-as-a-judge and human expert analysis. Experimental results show that COGENT consistently produces grade-appropriate passages that are comparable or superior to human references. Our work establishes a viable approach for scaling adaptive and high-quality learning resources.