🤖 AI Summary
This work addresses the challenge users face in maintaining narrative coherence and visual consistency when editing localized elements in human-AI collaborative digital storytelling. To this end, the authors propose a novel “Creative Decomposition and Linking” paradigm that explicitly decomposes a story into structured units—such as plot, characters, and scenes—and leverages generative AI to enable independent yet controllable generation of each unit through structured prompts and cross-modal association modeling, all while enforcing global coherence constraints. The resulting StoryComposerAI system significantly enhances users’ fine-grained control over the creative process while preserving narrative continuity and multimodal consistency, thereby demonstrating the effectiveness and innovation of the proposed paradigm.
📝 Abstract
GenAI's ability to produce text and images is increasingly incorporated into human-AI co-creation tasks such as storytelling and video editing. However, integrating GenAI into these tasks requires enabling users to retain control over editing individual story elements while ensuring that generated visuals remain coherent with the storyline and consistent across multiple AI-generated outputs. This work examines a paradigm of creative decomposition and linking, which allows creators to clearly communicate creative intent by prompting GenAI to tailor specific story elements, such as storylines, personas, locations, and scenes, while maintaining coherence among them. We implement and evaluate StoryComposerAI, a system that exemplifies this paradigm for enhancing users' sense of control and content consistency in human-AI co-creation of digital stories.