Identity-free Artificial Emotional Intelligence via Micro-Gesture Understanding

📅 2024-05-21
🏛️ arXiv.org
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the modeling challenge of micro-gestures (MGs) in identity-agnostic affective intelligence. It introduces the first systematic definition of MG-based affective semantics, departing from conventional action recognition paradigms. Methodologically, we propose a plug-and-play spatio-temporal balanced fusion module and establish a micro-pose-aware enhancement strategy synergized with large language models for affective reasoning. Our contributions are threefold: (1) We uncover, for the first time, the unique semantic value of MGs in fine-grained, unconscious affective expression; (2) Our approach achieves state-of-the-art performance on MG recognition with strong cross-dataset generalization; (3) It significantly improves the completeness and depth of affective understanding, enabling downstream applications such as deception detection.

Technology Category

Application Category

📝 Abstract
In this work, we focus on a special group of human body language -- the micro-gesture (MG), which differs from the range of ordinary illustrative gestures in that they are not intentional behaviors performed to convey information to others, but rather unintentional behaviors driven by inner feelings. This characteristic introduces two novel challenges regarding micro-gestures that are worth rethinking. The first is whether strategies designed for other action recognition are entirely applicable to micro-gestures. The second is whether micro-gestures, as supplementary data, can provide additional insights for emotional understanding. In recognizing micro-gestures, we explored various augmentation strategies that take into account the subtle spatial and brief temporal characteristics of micro-gestures, often accompanied by repetitiveness, to determine more suitable augmentation methods. Considering the significance of temporal domain information for micro-gestures, we introduce a simple and efficient plug-and-play spatiotemporal balancing fusion method. We not only studied our method on the considered micro-gesture dataset but also conducted experiments on mainstream action datasets. The results show that our approach performs well in micro-gesture recognition and on other datasets, achieving state-of-the-art performance compared to previous micro-gesture recognition methods. For emotional understanding based on micro-gestures, we construct complex emotional reasoning scenarios. Our evaluation, conducted with large language models, shows that micro-gestures play a significant and positive role in enhancing comprehensive emotional understanding. The scenarios we developed can be extended to other micro-gesture-based tasks such as deception detection and interviews. We confirm that our new insights contribute to advancing research in micro-gesture and emotional artificial intelligence.
Problem

Research questions and friction points this paper is trying to address.

Enhancing micro-gesture recognition accuracy
Improving emotional understanding via micro-gestures
Developing new strategies for micro-gesture analysis
Innovation

Methods, ideas, or system contributions that make the work stand out.

Spatiotemporal balancing fusion method
Augmentation strategies for micro-gestures
Complex emotional reasoning scenarios
🔎 Similar Papers
No similar papers found.
Rong Gao
Rong Gao
Tsinghua University
Uncertainty TheoryProbability Theory
X
Xin Liu
Computer Vision and Pattern Recognition Laboratory, School of Engineering Sciences, Lappeenranta-Lahti University of Technology LUT, Finland
Bohao Xing
Bohao Xing
Lappeenranta-Lahti University of Technology LUT
Emotion AI
Zitong Yu
Zitong Yu
U.S. Food and Drug Administration
Medical imagingDeep learningMachine learningImage reconstruction
B
Björn W. Schuller
Group on Language, Audio, & Music, Imperial College London, United Kingdom; School of Medicine and Health, Technical University of Munich, Germany
H
H. Kälviäinen
Computer Vision and Pattern Recognition Laboratory, School of Engineering Sciences, Lappeenranta-Lahti University of Technology LUT, Finland