Self-Cascaded Diffusion Models for Arbitrary-Scale Image Super-Resolution

📅 2025-06-09

📈 Citations: 0

✨ Influential: 0

career value

194K/year

🤖 AI Summary

Arbitrary-scale image super-resolution (ArbiSR) requires modeling a continuous and broad range of scale factors, yet existing single-stage methods suffer from poor generalization, and progressive diffusion schemes remain unexplored systematically. This paper proposes CasArbi, a self-cascaded diffusion framework that decomposes large-scale upsampling into multiple small-step diffusion stages. We design a coordinate-guided residual diffusion model that jointly incorporates coordinate embeddings and continuous scale parameterization, enabling efficient and flexible scale-aware modeling and sampling. Evaluated on multiple ArbiSR benchmarks, CasArbi achieves state-of-the-art performance in both perceptual quality (LPIPS) and fidelity metrics (PSNR/SSIM), significantly outperforming existing diffusion-based and non-diffusion-based approaches.

Technology Category

Application Category

📝 Abstract

Arbitrary-scale image super-resolution aims to upsample images to any desired resolution, offering greater flexibility than traditional fixed-scale super-resolution. Recent approaches in this domain utilize regression-based or generative models, but many of them are a single-stage upsampling process, which may be challenging to learn across a wide, continuous distribution of scaling factors. Progressive upsampling strategies have shown promise in mitigating this issue, yet their integration with diffusion models for flexible upscaling remains underexplored. Here, we present CasArbi, a novel self-cascaded diffusion framework for arbitrary-scale image super-resolution. CasArbi meets the varying scaling demands by breaking them down into smaller sequential factors and progressively enhancing the image resolution at each step with seamless transitions for arbitrary scales. Our novel coordinate-guided residual diffusion model allows for the learning of continuous image representations while enabling efficient diffusion sampling. Extensive experiments demonstrate that our CasArbi outperforms prior arts in both perceptual and distortion performance metrics across diverse arbitrary-scale super-resolution benchmarks.

Problem

Research questions and friction points this paper is trying to address.

Enables arbitrary-scale image super-resolution flexibly

Addresses challenges in continuous scaling factor learning

Improves perceptual and distortion performance metrics

Innovation

Methods, ideas, or system contributions that make the work stand out.

Self-cascaded diffusion framework for flexible upscaling

Coordinate-guided residual diffusion model

Progressive enhancement with seamless scale transitions

🔎 Similar Papers

TDDSR: Single-Step Diffusion with Two Discriminators for Super Resolution