A cross-modal network for facial expression recognition

πŸ“… 2026-05-05
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF

career value

225K/year
πŸ“ Abstract
Deep neural networks enriched with structural information have been widely employed for facial expression recognition tasks. However, these methods often depend on hierarchical information rather than face property to finish expression recognition. In this paper, we propose a cross-modal network with strong biological and structural information for facial expression recognition (CMNet). CMNet can respectively learn expression information via face symmetry on a whole face, left and right half faces to extract complementary facial features. To prevent negative effect of biological and structural information fusion, a salient facial information refinement module can obtain salient facial expression information to improve stability of an obtained facial expression classifier. To reduce reliance on unilateral facial features, a half-face alignment optimization mechanism is designed to align obtained expression information of learned left and right half faces. Our experimental results demonstrate that CMNet outperforms several novel methods, i.e., SCN and LAENet-SA for facial expression recognition. Codes can be obtained at https://github.com/hellloxiaotian/CMNet.
Problem

Research questions and friction points this paper is trying to address.

facial expression recognition
cross-modal network
face symmetry
structural information
half-face alignment
Innovation

Methods, ideas, or system contributions that make the work stand out.

cross-modal network
face symmetry
salient facial information refinement
half-face alignment
facial expression recognition
πŸ”Ž Similar Papers
No similar papers found.
C
Chunwei Tian
School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China and School of Software, Northwestern Polytechnical University, Xi’an 710072, China
J
Jingyuan Xie
School of Software, Northwestern Polytechnical University, Xi’an 710072, China
Q
Qi Zhang
School of Economics and Management, Harbin Institute of Technology at Weihai, Weihai, 264209, China
C
Chao Li
School of Computer Science and Engineering, Central South University, Changsha, China
Wangmeng Zuo
Wangmeng Zuo
School of Computer Science and Technology, Harbin Institute of Technology
Computer VisionImage ProcessingGenerative AIDeep LearningBiometrics
Shichao Zhang
Shichao Zhang
Guangxi Normal University
Big DataData underlying logicKNN