GameFactory: Creating New Games with Generative Interactive Videos

πŸ“… 2025-01-14
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Current generative game engines exhibit significant limitations in cross-scene generalization and precise action control, particularly when adapting to fixed-style games. To address this, we propose the first scene-generalization-driven framework for game video generation, built upon an open-source video diffusion model. Our method introduces a multi-stage fine-tuning strategy that decouples style learning from motion control, augmented by a novel motion-conditioning injection mechanism enabling arbitrarily long, action-controllable autoregressive video generation. We train and evaluate the framework on GF-Minecraftβ€”a large-scale, real-world Minecraft dataset with fine-grained human action annotations. Experiments demonstrate substantial improvements in open-domain game video diversity, action fidelity, and cross-scene generalization capability. The code and dataset are publicly released, establishing a new paradigm for AI-driven game content generation.

Technology Category

Application Category

πŸ“ Abstract
Generative game engines have the potential to revolutionize game development by autonomously creating new content and reducing manual workload. However, existing video-based game generation methods fail to address the critical challenge of scene generalization, limiting their applicability to existing games with fixed styles and scenes. In this paper, we present GameFactory, a framework focused on exploring scene generalization in game video generation. To enable the creation of entirely new and diverse games, we leverage pre-trained video diffusion models trained on open-domain video data. To bridge the domain gap between open-domain priors and small-scale game dataset, we propose a multi-phase training strategy that decouples game style learning from action control, preserving open-domain generalization while achieving action controllability. Using Minecraft as our data source, we release GF-Minecraft, a high-quality and diversity action-annotated video dataset for research. Furthermore, we extend our framework to enable autoregressive action-controllable game video generation, allowing the production of unlimited-length interactive game videos. Experimental results demonstrate that GameFactory effectively generates open-domain, diverse, and action-controllable game videos, representing a significant step forward in AI-driven game generation. Our dataset and project page are publicly available at url{https://vvictoryuki.github.io/gamefactory/}.
Problem

Research questions and friction points this paper is trying to address.

Generative Game Engines
Video-to-Game Adaptation
Multi-scene Compatibility
Innovation

Methods, ideas, or system contributions that make the work stand out.

GameFactory
Generative Game Engine
AI Game Creation
πŸ”Ž Similar Papers