Assessing Cognitive Effort in L2 Idiomatic Processing: An Eye-Tracking Dataset

📅 2026-05-06
📈 Citations: 0
Influential: 0
📄 PDF

career value

175K/year
📝 Abstract
This paper presents the development and validation of an eye-tracking dataset designed to investigate how second-language (L2) learners process idiomatic expressions. While native speakers often rely on direct retrieval of figurative meanings, L2 speakers frequently adopt a literal-first approach, which incurs measurable cognitive costs. This resource captures these costs through ocular metrics recorded from Portuguese L1 speakers of English across all CEFR proficiency levels (A1-C2). Although the study uses entry-level 60 Hz hardware (Tobii Pro Spark), we demonstrate that this sampling rate provides sufficient data density to detect macro-cognitive events such as fixations and regressions in reading. Preliminary analysis validates the dataset by revealing a strong inverse correlation between language proficiency and regressive eye movements. Integrated into the MIA (Modeling Idiomaticity in Human and Artificial Language Processing) initiative, this dataset serves as a cognitively grounded benchmark for evaluating both human processing models and the alignment of large language models with human-like figurative understanding.
Problem

Research questions and friction points this paper is trying to address.

cognitive effort
L2 idiomatic processing
eye-tracking
figurative language
language proficiency
Innovation

Methods, ideas, or system contributions that make the work stand out.

eye-tracking dataset
L2 idiomatic processing
cognitive effort
figurative language understanding
low-cost eye-tracking
🔎 Similar Papers
No similar papers found.