BaZi-Based Character Simulation Benchmark: Evaluating AI on Temporal and Persona Reasoning

๐Ÿ“… 2025-10-27
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Current virtual character generation faces bottlenecks including heavy reliance on annotated data, weak personality coherence, and insufficient cultural adaptation. To address these, we propose BaZi-LLMโ€”a novel framework that pioneers the integration of Chinese Bazi (Eight Characters) astrology as a structured personality constraint mechanism, synergizing symbolic reasoning with large language models (e.g., DeepSeek-v3, GPT-5-mini). We further introduce the first Bazi-oriented QA dataset grounded in life domainsโ€”wealth, health, family, career, and relationships. Our approach enables fine-grained, temporally dynamic virtual personality modeling, significantly enhancing character authenticity and cultural consistency. Evaluation across multiple dimensions shows accuracy improvements of 30.3%โ€“62.6% over baselines. Moreover, performance degrades by 20%โ€“45% under erroneous Bazi inputs, confirming deep cultural logic embedding and robust reasoning consistency. This work establishes a new paradigm for culture-enhanced AI character modeling.

Technology Category

Application Category

๐Ÿ“ Abstract
Human-like virtual characters are crucial for games, storytelling, and virtual reality, yet current methods rely heavily on annotated data or handcrafted persona prompts, making it difficult to scale up and generate realistic, contextually coherent personas. We create the first QA dataset for BaZi-based persona reasoning, where real human experiences categorized into wealth, health, kinship, career, and relationships are represented as life-event questions and answers. Furthermore, we propose the first BaZi-LLM system that integrates symbolic reasoning with large language models to generate temporally dynamic and fine-grained virtual personas. Compared with mainstream LLMs such as DeepSeek-v3 and GPT-5-mini, our method achieves a 30.3%-62.6% accuracy improvement. In addition, when incorrect BaZi information is used, our model's accuracy drops by 20%-45%, showing the potential of culturally grounded symbolic-LLM integration for realistic character simulation.
Problem

Research questions and friction points this paper is trying to address.

Creating benchmark for temporal persona reasoning evaluation
Developing symbolic-LLM integration for dynamic character simulation
Addressing cultural grounding in virtual persona generation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Integrates symbolic reasoning with large language models
Generates temporally dynamic and fine-grained virtual personas
Uses culturally grounded BaZi-based persona reasoning dataset
๐Ÿ”Ž Similar Papers
No similar papers found.
S
Siyuan Zheng
MirrorAI Co., Ltd.
Pai Liu
Pai Liu
University of Rochester
AI4HealthcareWeb AgentLLM
X
Xi Chen
MirrorAI Co., Ltd.
J
Jizheng Dong
New York University
S
Sihan Jia
Georgia State University