REVELIO -- Universal Multimodal Task Load Estimation for Cross-Domain Generalization

📅 2025-09-01
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing cognitive load estimation models suffer from poor generalizability, with insufficient evaluation of robustness and transferability in real-world settings. To address this, we propose the first cross-domain transferable cognitive load assessment framework. We introduce a novel, ecologically valid multimodal dataset integrating n-back tasks and game-based applications, annotated along three dimensions: objective behavioral performance, subjective self-reports, and task design parameters. Leveraging end-to-end architectures—including xLSTM, ConvNeXt, and Transformer—we jointly model heterogeneous physiological and behavioral signals. A systematic cross-domain evaluation reveals that multimodal approaches significantly outperform unimodal baselines; however, non-negligible performance degradation persists under domain shifts, exposing fundamental challenges in generalizable cognitive load estimation. This work establishes a reproducible, extensible cross-domain benchmark, providing both a new paradigm for transferable cognitive modeling and a strong, publicly available baseline.

Technology Category

Application Category

📝 Abstract
Task load detection is essential for optimizing human performance across diverse applications, yet current models often lack generalizability beyond narrow experimental domains. While prior research has focused on individual tasks and limited modalities, there remains a gap in evaluating model robustness and transferability in real-world scenarios. This paper addresses these limitations by introducing a new multimodal dataset that extends established cognitive load detection benchmarks with a real-world gaming application, using the $n$-back test as a scientific foundation. Task load annotations are derived from objective performance, subjective NASA-TLX ratings, and task-level design, enabling a comprehensive evaluation framework. State-of-the-art end-to-end model, including xLSTM, ConvNeXt, and Transformer architectures are systematically trained and evaluated on multiple modalities and application domains to assess their predictive performance and cross-domain generalization. Results demonstrate that multimodal approaches consistently outperform unimodal baselines, with specific modalities and model architectures showing varying impact depending on the application subset. Importantly, models trained on one domain exhibit reduced performance when transferred to novel applications, underscoring remaining challenges for universal cognitive load estimation. These findings provide robust baselines and actionable insights for developing more generalizable cognitive load detection systems, advancing both research and practical implementation in human-computer interaction and adaptive systems.
Problem

Research questions and friction points this paper is trying to address.

Develops universal task load estimation model for cross-domain generalization
Evaluates multimodal model robustness in real-world gaming scenarios
Addresses performance degradation when transferring models to novel applications
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multimodal dataset with gaming application
xLSTM ConvNeXt Transformer architectures evaluated
Cross-domain generalization performance assessed
M
Maximilian P. Oppelt
Department Digital Health and Analytics, Fraunhofer IIS, Fraunhofer Institute for Integrated Circuits IIS, 91058 Erlangen, Germany and Department Artificial Intelligence in Biomedical Engineering, Friedrich-Alexander-University Erlangen Nuremberg, 91052 Erlangen, Germany
A
Andreas Foltyn
Department Digital Health and Analytics, Fraunhofer IIS, Fraunhofer Institute for Integrated Circuits IIS, 91058 Erlangen, Germany
N
Nadine R. Lang-Richter
Department Digital Health and Analytics, Fraunhofer IIS, Fraunhofer Institute for Integrated Circuits IIS, 91058 Erlangen, Germany
Bjoern M. Eskofier
Bjoern M. Eskofier
MaD Lab, FAU Erlangen-Nürnberg & TDH Group, Helmholtz Munich
Machine LearningArtificial IntelligenceWearable ComputingDigital HealthBiomedical Eng