Eye Gaze as a Signal for Conveying User Attention in Contextual AI Systems

📅 2025-01-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the limitation of conventional attention modeling in multimodal AI agents—its reliance on implicit assumptions and lack of explicit physiological grounding for inferring users’ true task intentions. We propose leveraging eye-tracking data as a primary attentional cue, integrating calibrated spatiotemporal scanpath sequences as structured contextual input into multimodal large language models’ reasoning pipeline. To our knowledge, this is the first systematic empirical validation—in realistic physical environments—that eye movement trajectories faithfully encode users’ task states and attentional foci. Experimental results demonstrate that our approach significantly improves AI agents’ real-time intent perception: task-relevant response accuracy increases by 37%. The method establishes a novel, interpretable, and generalizable paradigm for embodied intention inference, grounded in objective physiological signals rather than heuristic or latent attention mechanisms.

Technology Category

Application Category

📝 Abstract
Advanced multimodal AI agents can now collaborate with users to solve challenges in the world. We explore eye tracking's role in such interaction to convey a user's attention relative to the physical environment. We hypothesize that this knowledge improves contextual understanding for AI agents. By observing hours of human-object interactions, we first measure the relationship between an eye tracker's signal quality and its ability to reliably place gaze on nearby physical objects. We then conduct experiments which relay the user's scanpath history as additional context querying multimodal agents. Our results show that eye tracking provides high value as a user attention signal and can convey information about the user's current task and interests to the agent.
Problem

Research questions and friction points this paper is trying to address.

Artificial Intelligence
Human Intention Understanding
Gaze Direction Analysis
Innovation

Methods, ideas, or system contributions that make the work stand out.

Eye Gaze Analysis
AI Intent Understanding
Performance Enhancement
Ethan Wilson
Ethan Wilson
Graduate Student, University of Florida
generative AIprivacy
N
Naveen Sendhilnathan
Meta Reality Labs Research, USA
C
C. Burlingham
Meta Reality Labs Research, USA
Y
Yusuf Mansour
Meta Reality Labs Research, USA
R
Robert Cavin
Meta Reality Labs Research, USA
S
S. Tetali
Meta Reality Labs, USA
A
Ajoy Savio Fernandes
Meta Reality Labs Research, USA
Michael J. Proulx
Michael J. Proulx
Research Scientist, Reality Labs Research; Professor of Cognition and Technology, University of Bath
Eye TrackingSmart GlassesAIAttentionMultisensory