CreatiPoster: Towards Editable and Controllable Multi-Layer Graphic Design Generation

📅 2025-06-12
📈 Citations: 0
Influential: 0
📄 PDF

career value

166K/year
🤖 AI Summary
To address the high skill barrier, poor editability, and low-quality multi-layer generation in graphic design, this paper introduces the first end-to-end framework for editable multi-layer poster generation. Methodologically: (1) we propose the first RGBA-aware large vision-language model that directly outputs structured JSON design specifications—including precise layout, layer hierarchy, styling attributes, and per-pixel RGBA channel information; (2) we introduce a foreground-background decoupled generation paradigm, integrating a conditional background synthesis model to enhance visual consistency; (3) we construct the first open-source, copyright-free, 100K-scale multi-layer design benchmark and corpus. Experiments demonstrate that our approach comprehensively outperforms state-of-the-art commercial systems (e.g., Canva Magic Design) in generation quality, editability, and controllability—enabling canvas-level editing, multilingual instruction following, responsive scaling, and dynamic poster generation.

Technology Category

Application Category

📝 Abstract
Graphic design plays a crucial role in both commercial and personal contexts, yet creating high-quality, editable, and aesthetically pleasing graphic compositions remains a time-consuming and skill-intensive task, especially for beginners. Current AI tools automate parts of the workflow, but struggle to accurately incorporate user-supplied assets, maintain editability, and achieve professional visual appeal. Commercial systems, like Canva Magic Design, rely on vast template libraries, which are impractical for replicate. In this paper, we introduce CreatiPoster, a framework that generates editable, multi-layer compositions from optional natural-language instructions or assets. A protocol model, an RGBA large multimodal model, first produces a JSON specification detailing every layer (text or asset) with precise layout, hierarchy, content and style, plus a concise background prompt. A conditional background model then synthesizes a coherent background conditioned on this rendered foreground layers. We construct a benchmark with automated metrics for graphic-design generation and show that CreatiPoster surpasses leading open-source approaches and proprietary commercial systems. To catalyze further research, we release a copyright-free corpus of 100,000 multi-layer designs. CreatiPoster supports diverse applications such as canvas editing, text overlay, responsive resizing, multilingual adaptation, and animated posters, advancing the democratization of AI-assisted graphic design. Project homepage: https://github.com/graphic-design-ai/creatiposter
Problem

Research questions and friction points this paper is trying to address.

Automating high-quality editable multi-layer graphic design generation
Enhancing user control over design assets and editability
Overcoming limitations of template-based commercial design systems
Innovation

Methods, ideas, or system contributions that make the work stand out.

Generates editable multi-layer designs from instructions
Uses RGBA model for precise layer specifications
Conditional background model ensures visual coherence
🔎 Similar Papers
No similar papers found.