CausalVAE as a Plug-in for World Models: Towards Reliable Counterfactual Dynamics

📅 2026-04-08
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the unreliability of world models in counterfactual dynamics prediction under distribution shifts and interventions. To this end, the authors propose integrating CausalVAE as a plug-and-play module into diverse encoder-transition backbone architectures. This approach preserves strong factual prediction performance while, for the first time, enabling interpretable learning of latent causal structures. The method substantially enhances model robustness and counterfactual reasoning capabilities under interventions, achieving a 102.5% average improvement in CF-H@1 on Physics benchmarks. Notably, under a specific GNN-NLL configuration, the CF-H@1 score increases from 11.0 to 41.0, representing a 272.7% relative gain.
📝 Abstract
In this work, CausalVAE is introduced as a plug-in structural module for latent world models and is attached to diverse encoder-transition backbones. Across the reported benchmarks, competitive factual prediction is preserved and intervention-aware counterfactual retrieval is improved after the plug-in is added, suggesting stronger robustness under distribution shift and interventions. The largest gains are observed on the Physics benchmark: when averaged over 8 paired baselines, CF-H@1 is improved by +102.5%. In a representative GNN-NLL setting on Physics, CF-H@1 is increased from 11.0 to 41.0 (+272.7%). Through causal analysis, learned structural dependencies are shown to recover meaningful first-order physical interaction trends, supporting the interpretability of the learned latent causal structure.
Problem

Research questions and friction points this paper is trying to address.

counterfactual dynamics
world models
causal representation
distribution shift
intervention
Innovation

Methods, ideas, or system contributions that make the work stand out.

CausalVAE
world models
counterfactual reasoning
causal representation learning
distribution shift robustness
🔎 Similar Papers
No similar papers found.
Z
Ziyi Ding
Tsinghua Shenzhen International Graduate School, Tsinghua University
X
Xianxin Lai
The University of Hong Kong
Weiyu Chen
Weiyu Chen
PhD Student, Hong Kong University of Science and Technology (HKUST)
Efficient LLMDiffusion ModelMulti-Objective Optimization
X
Xiao-Ping Zhang
Tsinghua Shenzhen International Graduate School, Tsinghua University
J
Jiayu Chen
The University of Hong Kong INFIFORCE Intelligent Technology