From Inpainting to Layer Decomposition: Repurposing Generative Inpainting Models for Image Layer Decomposition

📅 2025-11-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses the single-image foreground-background layer decomposition problem. The proposed method leverages a lightweight diffusion-based layer disentanglement framework. Methodologically, it (1) adapts a pre-trained generative inpainting model via light-weight fine-tuning for layer separation; (2) introduces a multimodal context fusion module that models cross-layer semantic dependencies in latent space using linear-complexity attention, thereby preserving fine-grained details; and (3) constructs a high-quality synthetic dataset enabling end-to-end training. Experimental results demonstrate substantial improvements over state-of-the-art approaches on object removal and occlusion recovery tasks. The method yields high-fidelity, editable layered representations, facilitating downstream applications such as image editing and content creation.

Technology Category

Application Category

📝 Abstract
Images can be viewed as layered compositions, foreground objects over background, with potential occlusions. This layered representation enables independent editing of elements, offering greater flexibility for content creation. Despite the progress in large generative models, decomposing a single image into layers remains challenging due to limited methods and data. We observe a strong connection between layer decomposition and in/outpainting tasks, and propose adapting a diffusion-based inpainting model for layer decomposition using lightweight finetuning. To further preserve detail in the latent space, we introduce a novel multi-modal context fusion module with linear attention complexity. Our model is trained purely on a synthetic dataset constructed from open-source assets and achieves superior performance in object removal and occlusion recovery, unlocking new possibilities in downstream editing and creative applications.
Problem

Research questions and friction points this paper is trying to address.

Adapting diffusion inpainting models for image layer decomposition
Solving single-image layer decomposition with limited methods/data
Enabling independent element editing through layered representation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Adapting diffusion inpainting model for layer decomposition
Introducing multi-modal context fusion with linear attention
Training on synthetic dataset for object removal tasks
🔎 Similar Papers
No similar papers found.