π€ AI Summary
Current generative AI systems struggle to reliably translate high-level creator intent into complex, controllable digital content, resulting in a significant intent-execution gap. This work proposes the Vibe AIGC paradigm, which introduces βVibeββa unified representation integrating aesthetic and functional intentβas the primary input. Leveraging a hierarchical multi-agent architecture orchestrated by a meta-planner, the framework dynamically composes executable, verifiable, and adaptive collaborative workflows. By moving beyond the limitations of monolithic black-box models, this approach shifts generative AI from stochastic output toward logically coordinated co-creation. It effectively bridges the divide between human creativity and AI execution, positioning AI as a trustworthy, system-level engineering partner capable of democratizing the creation of highly complex digital content.
π Abstract
For the past decade, the trajectory of generative artificial intelligence (AI) has been dominated by a model-centric paradigm driven by scaling laws. Despite significant leaps in visual fidelity, this approach has encountered a ``usability ceiling''manifested as the Intent-Execution Gap (i.e., the fundamental disparity between a creator's high-level intent and the stochastic, black-box nature of current single-shot models). In this paper, inspired by the Vibe Coding, we introduce the \textbf{Vibe AIGC}, a new paradigm for content generation via agentic orchestration, which represents the autonomous synthesis of hierarchical multi-agent workflows. Under this paradigm, the user's role transcends traditional prompt engineering, evolving into a Commander who provides a Vibe, a high-level representation encompassing aesthetic preferences, functional logic, and etc. A centralized Meta-Planner then functions as a system architect, deconstructing this ``Vibe''into executable, verifiable, and adaptive agentic pipelines. By transitioning from stochastic inference to logical orchestration, Vibe AIGC bridges the gap between human imagination and machine execution. We contend that this shift will redefine the human-AI collaborative economy, transforming AI from a fragile inference engine into a robust system-level engineering partner that democratizes the creation of complex, long-horizon digital assets.