PixelRush: Ultra-Fast, Training-Free High-Resolution Image Generation via One-step Diffusion

📅 2026-02-13

📈 Citations: 0

✨ Influential: 0

📄 PDF

career value

180K/year

Technology Category

Application Category

📝 Abstract

Pre-trained diffusion models excel at generating high-quality images but remain inherently limited by their native training resolution. Recent training-free approaches have attempted to overcome this constraint by introducing interventions during the denoising process; however, these methods incur substantial computational overhead, often requiring more than five minutes to produce a single 4K image. In this paper, we present PixelRush, the first tuning-free framework for practical high-resolution text-to-image generation. Our method builds upon the established patch-based inference paradigm but eliminates the need for multiple inversion and regeneration cycles. Instead, PixelRush enables efficient patch-based denoising within a low-step regime. To address artifacts introduced by patch blending in few-step generation, we propose a seamless blending strategy. Furthermore, we mitigate over-smoothing effects through a noise injection mechanism. PixelRush delivers exceptional efficiency, generating 4K images in approximately 20 seconds representing a 10$\times$ to 35$\times$ speedup over state-of-the-art methods while maintaining superior visual fidelity. Extensive experiments validate both the performance gains and the quality of outputs achieved by our approach.

Problem

Research questions and friction points this paper is trying to address.

high-resolution image generation

diffusion models

training-free

computational efficiency

text-to-image

Innovation

Methods, ideas, or system contributions that make the work stand out.

training-free

one-step diffusion

patch-based generation