Humanoid-inspired Causal Representation Learning for Domain Generalization

📅 2025-10-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Traditional domain generalization (DG) methods rely on statistical correlations, failing to capture causal dependencies between data and labels, and thus suffer from limited generalizability and interpretability in dynamic, complex environments. To address this, we propose the Human-like Structural Causal Model (HSCM), the first DG framework to incorporate the hierarchical processing mechanism of the human visual system into causal representation learning. HSCM achieves fine-grained disentanglement of image attributes—color, texture, and shape—and integrates dynamic causal reweighting to learn domain-invariant representations grounded in causal invariance. By unifying structural causal models (SCMs), causal representation learning, and hierarchical neural networks, HSCM theoretically guarantees causal invariance. Empirically, it achieves state-of-the-art performance across multiple standard DG benchmarks, demonstrating superior generalizability, robustness, and interpretability. The implementation is publicly available.

Technology Category

Application Category

📝 Abstract
This paper proposes the Humanoid-inspired Structural Causal Model (HSCM), a novel causal framework inspired by human intelligence, designed to overcome the limitations of conventional domain generalization models. Unlike approaches that rely on statistics to capture data-label dependencies and learn distortion-invariant representations, HSCM replicates the hierarchical processing and multi-level learning of human vision systems, focusing on modeling fine-grained causal mechanisms. By disentangling and reweighting key image attributes such as color, texture, and shape, HSCM enhances generalization across diverse domains, ensuring robust performance and interpretability. Leveraging the flexibility and adaptability of human intelligence, our approach enables more effective transfer and learning in dynamic, complex environments. Through both theoretical and empirical evaluations, we demonstrate that HSCM outperforms existing domain generalization models, providing a more principled method for capturing causal relationships and improving model robustness. The code is available at https://github.com/lambett/HSCM.
Problem

Research questions and friction points this paper is trying to address.

Proposes a human-inspired causal model for domain generalization
Disentangles image attributes like color, texture, and shape
Enhances generalization and robustness across diverse domains
Innovation

Methods, ideas, or system contributions that make the work stand out.

Humanoid-inspired causal framework for domain generalization
Disentangles and reweights image attributes like color and shape
Models hierarchical processing of human vision systems
🔎 Similar Papers
No similar papers found.
Ze Tao
Ze Tao
Changchun University of Science and Technology
Heat conductionPINNTopological insulator
J
Jian Zhang
School of Computer Science, Central South University, No.932 South Lushan Road, Changsha, 410083, Hunan, China.
H
Haowei Li
School of Computer Science, Central South University, No.932 South Lushan Road, Changsha, 410083, Hunan, China.
X
Xianshuai Li
School of Computer Science, Central South University, No.932 South Lushan Road, Changsha, 410083, Hunan, China.
Y
Yifei Peng
School of Computer Science, Central South University, No.932 South Lushan Road, Changsha, 410083, Hunan, China.
X
Xiyao Liu
School of Computer Science, Central South University, No.932 South Lushan Road, Changsha, 410083, Hunan, China.
S
Senzhang Wang
School of Computer Science, Central South University, No.932 South Lushan Road, Changsha, 410083, Hunan, China.
C
Chao Liu
School of Computer Science, Tsinghua University, No.30 Shuangqing Road, Beijing, 100084, China.
S
Sheng Ren
School of Computer and Electrical Engineering, Hunan University of Arts and Sciences, No. 3150, Dongting Road, Changde, 415000, Hunan, China.
Shichao Zhang
Shichao Zhang
Guangxi Normal University
Big DataData underlying logicKNN