Assessing Cognitive Effort in L2 Idiomatic Processing: An Eye-Tracking Dataset

📅 2026-05-06

📈 Citations: 0

✨ Influential: 0

career value

152K/year

📝 Abstract

This paper presents the development and validation of an eye-tracking dataset designed to investigate how second-language (L2) learners process idiomatic expressions. While native speakers often rely on direct retrieval of figurative meanings, L2 speakers frequently adopt a literal-first approach, which incurs measurable cognitive costs. This resource captures these costs through ocular metrics recorded from Portuguese L1 speakers of English across all CEFR proficiency levels (A1-C2). Although the study uses entry-level 60 Hz hardware (Tobii Pro Spark), we demonstrate that this sampling rate provides sufficient data density to detect macro-cognitive events such as fixations and regressions in reading. Preliminary analysis validates the dataset by revealing a strong inverse correlation between language proficiency and regressive eye movements. Integrated into the MIA (Modeling Idiomaticity in Human and Artificial Language Processing) initiative, this dataset serves as a cognitively grounded benchmark for evaluating both human processing models and the alignment of large language models with human-like figurative understanding.

Problem

Research questions and friction points this paper is trying to address.

cognitive effort

L2 idiomatic processing

eye-tracking

figurative language

language proficiency

Innovation

Methods, ideas, or system contributions that make the work stand out.

eye-tracking dataset

L2 idiomatic processing

cognitive effort