Mask-guided multi-task network for face attribute recognition

πŸ“… 2026-01-04
πŸ›οΈ International Journal of Machine Learning and Cybernetics
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work proposes a mask-guided multitask network for facial attribute recognition to address the limitations of conventional approaches that rely on global features, which often introduce redundant information and suffer from negative transfer. By leveraging an adaptive masking mechanism, the method precisely localizes critical facial regions through group masks generated by integrating a pretrained landmark model with a fully convolutional network. This enables effective fusion of localized group features and global contextual information. Extensive experiments on two widely used facial attribute datasets demonstrate significant performance improvements, validating the approach’s effectiveness and robustness. The core contribution lies in mitigating negative transfer across tasks while enhancing the discriminative power of learned features.

Technology Category

Application Category

πŸ“ Abstract
Face Attribute Recognition (FAR) plays a crucial role in applications such as person re-identification, face retrieval, and face editing. Conventional multi-task attribute recognition methods often process the entire feature map for feature extraction and attribute classification, which can produce redundant features due to reliance on global regions. To address these challenges, we propose a novel approach emphasizing the selection of specific feature regions for efficient feature learning. We introduce the Mask-Guided Multi-Task Network (MGMTN), which integrates Adaptive Mask Learning (AML) and Group-Global Feature Fusion (G2FF) to address the aforementioned limitations. Leveraging a pre-trained keypoint annotation model and a fully convolutional network, AML accurately localizes critical facial parts (e.g., eye and mouth groups) and generates group masks that delineate meaningful feature regions, thereby mitigating negative transfer from global region usage. Furthermore, G2FF combines group and global features to enhance FAR learning, enabling more precise attribute identification. Extensive experiments on two challenging facial attribute recognition datasets demonstrate the effectiveness of MGMTN in improving FAR performance.
Problem

Research questions and friction points this paper is trying to address.

Face Attribute Recognition
Multi-Task Learning
Feature Redundancy
Global Region Dependency
Innovation

Methods, ideas, or system contributions that make the work stand out.

Mask-Guided Multi-Task Network
Adaptive Mask Learning
Group-Global Feature Fusion
Face Attribute Recognition
Feature Localization
πŸ”Ž Similar Papers
No similar papers found.
G
Gong Gao
School of Computer Science and Technology, Tongji University, Shanghai, 200092, Shanghai, China.
Z
Zekai Wang
School of Computer Science and Technology, Tongji University, Shanghai, 200092, Shanghai, China.
Jian Zhao
Jian Zhao
Dalian University of Technology
Mechanical sensingmultistable mechanismsinertial sensorsdynamics
Z
Ziqi Xie
Shanghai University of Engineering Science, Shanghai, 201620, Shanghai, China.
X
Xianhui Liu
School of Computer Science and Technology, Tongji University, Shanghai, 200092, Shanghai, China.
Weidong Zhao
Weidong Zhao
Shandong University
Numerical analysis