🤖 AI Summary
Current cross-prompt automated essay scoring (AES) suffers from poor generalization to unseen prompts, primarily because existing models rely heavily on prompt-specific essay-score pairs and thus fail to learn prompt-invariant, generalizable representations. To address this, we propose Grammar-Aware Prompt-Specific trait Scoring (GAPS), the first AES framework to explicitly incorporate grammatical error correction (GEC) signals. GAPS employs a dual-encoder architecture that jointly encodes both the original essay and its GEC-corrected version, enabling syntactic-level disentanglement of prompt-dependent features and facilitating learning of prompt-independent essay representations. Under rigorous cross-prompt evaluation, GAPS achieves significant gains in weighted quadratic kappa (QWK), particularly improving accuracy on grammar- and vocabulary-related scoring dimensions. Overall, it surpasses state-of-the-art methods in cross-prompt generalization performance.
📝 Abstract
In automated essay scoring (AES), recent efforts have shifted toward cross-prompt settings that score essays on unseen prompts for practical applicability. However, prior methods trained with essay-score pairs of specific prompts pose challenges in obtaining prompt-generalized essay representation. In this work, we propose a grammar-aware cross-prompt trait scoring (GAPS), which internally captures prompt-independent syntactic aspects to learn generic essay representation. We acquire grammatical error-corrected information in essays via the grammar error correction technique and design the AES model to seamlessly integrate such information. By internally referring to both the corrected and the original essays, the model can focus on generic features during training. Empirical experiments validate our method's generalizability, showing remarkable improvements in prompt-independent and grammar-related traits. Furthermore, GAPS achieves notable QWK gains in the most challenging cross-prompt scenario, highlighting its strength in evaluating unseen prompts.