Retrieval Augmented Comic Image Generation

📅 2025-06-14

📈 Citations: 0

✨ Influential: 0

career value

197K/year

🤖 AI Summary

This work addresses the challenges of inter-frame inconsistency in character identity and attire, as well as monotonous pose variation, in comic-style image sequence generation. We propose a retrieval-augmented, region-controlled diffusion model. Our method integrates image retrieval, text–image alignment, region-conditioned modeling, and diffusion model fine-tuning. Key contributions include: (1) a retrieval-based character matching module that aligns textual prompts with character appearance using reference images; and (2) a region-wise character feature injection mechanism enabling localized control over facial features, clothing, and other semantic parts. Evaluated on multi-frame comic generation, our approach significantly improves character consistency and pose diversity, achieving state-of-the-art narrative coherence and visual vividness.

Technology Category

Application Category

📝 Abstract

We present RaCig, a novel system for generating comic-style image sequences with consistent characters and expressive gestures. RaCig addresses two key challenges: (1) maintaining character identity and costume consistency across frames, and (2) producing diverse and vivid character gestures. Our approach integrates a retrieval-based character assignment module, which aligns characters in textual prompts with reference images, and a regional character injection mechanism that embeds character features into specified image regions. Experimental results demonstrate that RaCig effectively generates engaging comic narratives with coherent characters and dynamic interactions. The source code will be publicly available to support further research in this area.

Problem

Research questions and friction points this paper is trying to address.

Ensures consistent character identity across comic frames

Generates diverse and vivid character gestures

Aligns text prompts with reference images for accuracy

Innovation

Methods, ideas, or system contributions that make the work stand out.

Retrieval-based character assignment module

Regional character injection mechanism

Generates consistent expressive comic sequences

🔎 Similar Papers

No similar papers found.