Modification Takes Courage: Seamless Image Stitching via Reference-Driven Inpainting

📅 2024-11-15

🏛️ arXiv.org

📈 Citations: 0

✨ Influential: 0

career value

160K/year

🤖 AI Summary

To address prominent seam artifacts in image stitching caused by inconsistent color tones and large disparities, this paper proposes a reference-driven stitching rectification paradigm that unifies fusion and geometric correction as a reference-guided image inpainting task—thereby expanding the editable region and enhancing local adaptability. Methodologically, we introduce the first self-supervised fine-tuned text-to-image diffusion model for stitching (requiring no annotated data) and establish the first multimodal large language model (MLLM)-based metric for stitching quality assessment, jointly incorporating semantic understanding and visual fidelity. Experiments demonstrate that our approach significantly outperforms state-of-the-art methods in content consistency, transition naturalness, and zero-shot generalization, particularly under challenging scenarios. This work provides a novel, principled framework for high-fidelity image stitching.

Technology Category

Application Category

📝 Abstract

Current image stitching methods often produce noticeable seams in challenging scenarios such as uneven hue and large parallax. To tackle this problem, we propose the Reference-Driven Inpainting Stitcher (RDIStitcher), which reformulates the image fusion and rectangling as a reference-based inpainting model, incorporating a larger modification fusion area and stronger modification intensity than previous methods. Furthermore, we introduce a self-supervised model training method, which enables the implementation of RDIStitcher without requiring labeled data by fine-tuning a Text-to-Image (T2I) diffusion model. Recognizing difficulties in assessing the quality of stitched images, we present the Multimodal Large Language Models (MLLMs)-based metrics, offering a new perspective on evaluating stitched image quality. Compared to the state-of-the-art (SOTA) method, extensive experiments demonstrate that our method significantly enhances content coherence and seamless transitions in the stitched images. Especially in the zero-shot experiments, our method exhibits strong generalization capabilities. Code: https://github.com/yayoyo66/RDIStitcher

Problem

Research questions and friction points this paper is trying to address.

Addresses noticeable seams in image stitching due to uneven hue and parallax.

Proposes a reference-based inpainting model for seamless image fusion.

Introduces self-supervised training and MLLM-based metrics for quality evaluation.

Innovation

Methods, ideas, or system contributions that make the work stand out.

Reference-Driven Inpainting for seamless stitching

Self-supervised training using Text-to-Image diffusion

MLLMs-based metrics for image quality evaluation

🔎 Similar Papers

No similar papers found.