RePainter: Empowering E-commerce Object Removal via Spatial-matting Reinforcement Learning

📅 2025-10-08
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address visual quality degradation in e-commerce images caused by watermarks and promotional text overlays, this paper proposes the first reinforcement learning–driven image inpainting framework tailored for commercial scenarios. Methodologically, it integrates diffusion models with a spatial-attention modulation mechanism, introduces spatial matting trajectory optimization, and employs Grouped Relative Policy Optimization (GRPO). A composite reward function is designed to jointly optimize global structure, local detail fidelity, and semantic consistency—effectively mitigating artifacts and reward hacking. To support reproducible research, we introduce EcomPaint-100K, a large-scale e-commerce image dataset, and EcomPaint-Bench, a dedicated evaluation benchmark. Extensive experiments demonstrate that our method significantly outperforms existing state-of-the-art approaches on challenging tasks involving complex compositions and persistent text/logo removal, yielding more natural and semantically coherent results. The code and models are publicly released.

Technology Category

Application Category

📝 Abstract
In web data, product images are central to boosting user engagement and advertising efficacy on e-commerce platforms, yet the intrusive elements such as watermarks and promotional text remain major obstacles to delivering clear and appealing product visuals. Although diffusion-based inpainting methods have advanced, they still face challenges in commercial settings due to unreliable object removal and limited domain-specific adaptation. To tackle these challenges, we propose Repainter, a reinforcement learning framework that integrates spatial-matting trajectory refinement with Group Relative Policy Optimization (GRPO). Our approach modulates attention mechanisms to emphasize background context, generating higher-reward samples and reducing unwanted object insertion. We also introduce a composite reward mechanism that balances global, local, and semantic constraints, effectively reducing visual artifacts and reward hacking. Additionally, we contribute EcomPaint-100K, a high-quality, large-scale e-commerce inpainting dataset, and a standardized benchmark EcomPaint-Bench for fair evaluation. Extensive experiments demonstrate that Repainter significantly outperforms state-of-the-art methods, especially in challenging scenes with intricate compositions. We will release our code and weights upon acceptance.
Problem

Research questions and friction points this paper is trying to address.

Removes intrusive watermarks and promotional text from e-commerce images
Improves unreliable object removal and domain adaptation in commercial settings
Reduces visual artifacts and unwanted object insertion in product visuals
Innovation

Methods, ideas, or system contributions that make the work stand out.

Integrates spatial-matting trajectory refinement with GRPO
Modulates attention mechanisms to emphasize background context
Introduces composite reward mechanism balancing multiple constraints
🔎 Similar Papers
No similar papers found.
Z
Zipeng Guo
Sun Yat-sen University, Shenzhen, China
L
Lichen Ma
JD.COM, Beijing, China
X
Xiaolong Fu
JD.COM, Beijing, China
G
Gaojing Zhou
JD.COM, Beijing, China
Lan Yang
Lan Yang
Edwin & Florence Skinner Professor, Electrical & Systems Engineering, Washington Univ. in St Louis
resonatorlasernonlinear opticssensingnon-Hermitian physics
Y
Yuchen Zhou
Sun Yat-sen University, Shenzhen, China
L
Linkai Liu
Sun Yat-sen University, Shenzhen, China
Y
Yu He
JD.COM, Beijing, China
X
Ximan Liu
JD.COM, Beijing, China
S
Shiping Dong
Hunan University, Hunan, China
J
Jingling Fu
JD.COM, Beijing, China
Z
Zhen Chen
JD.COM, Beijing, China
Y
Yu Shi
JD.COM, Beijing, China
Junshi Huang
Junshi Huang
Meituan
Computer VisionNLPMachine Learning
J
Jason Li
JD.COM, Beijing, China
Chao Gou
Chao Gou
Associate Professor, Sun Yat-sen University
Computer visionMachine learningArtificial intelligenceParallel intelligence