Semantic-Aware Caching for Efficient Image Generation in Edge Computing

📅 2025-11-27
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the high latency and computational overhead of diffusion-based text-to-image generation in edge computing environments, this paper proposes a semantic-aware hybrid generative acceleration framework. The framework synergistically integrates text-only generation with image-guided generation, initializing the denoising process using semantically similar cached reference images to substantially reduce the number of denoising steps. It introduces three key innovations: (1) a semantic classification-based caching mechanism, (2) a dynamic request scheduling algorithm, and (3) a correlation-aware proactive cache maintenance strategy—collectively ensuring semantic alignment and cache efficiency. Extensive experiments on a real-world edge system demonstrate that the proposed method reduces end-to-end generation latency by 41% and computational cost by 48%, while preserving image fidelity comparable to state-of-the-art methods.

Technology Category

Application Category

📝 Abstract
Text-to-image generation employing diffusion models has attained significant popularity due to its capability to produce high-quality images that adhere to textual prompts. However, the integration of diffusion models faces critical challenges into resource-constrained mobile and edge environments because it requires multiple denoising steps from the original random noise. A practical way to speed up denoising is to initialize the process with a noised reference image that is similar to the target, since both images share similar layouts, structures, and details, allowing for fewer denoising steps. Based on this idea, we present CacheGenius, a hybrid image generation system in edge computing that accelerates generation by combining text-toimage and image-to-image workflows. It generates images from user text prompts using cached reference images. CacheGenius introduces a semantic-aware classified storage scheme and a request-scheduling algorithm that ensures semantic alignment between references and targets. To ensure sustained performance, it employs a cache maintenance policy that proactively evicts obsolete entries via correlation analysis. Evaluated in a distributed edge computing system, CacheGenius reduces generation latency by 41% and computational costs by 48% relative to baselines, while maintaining competitive evaluation metrics.
Problem

Research questions and friction points this paper is trying to address.

Accelerates image generation in edge computing
Reduces computational costs for diffusion models
Ensures semantic alignment between cached and target images
Innovation

Methods, ideas, or system contributions that make the work stand out.

Semantic-aware caching accelerates image generation
Hybrid system combines text-to-image and image-to-image workflows
Cache maintenance policy evicts obsolete entries proactively
🔎 Similar Papers
No similar papers found.
H
Hanshuai Cui
School of Artificial Intelligence, Beijing Normal University, Beijing 100875, China, and also with Institute of Artificial Intelligence and Future Networks, Beijing Normal University, Zhuhai 519087, China
Zhiqing Tang
Zhiqing Tang
Associate Professor, Beijing Normal University
Edge ComputingEdge AI SystemsContainerReinforcement Learning
Z
Zhi Yao
School of Artificial Intelligence, Beijing Normal University, Beijing 100875, China, and also with the Institute of Artificial Intelligence and Future Networks, Beijing Normal University, Zhuhai 519087, China
W
Weijia Ji
Institute of Artificial Intelligence and Future Networks, Beijing Normal University, Zhuhai 519087, China and also with Guangdong Key Lab of AI and Multi-Modal Data Processing, Beijing Normal-Hong Kong Baptist University, Zhuhai 519087, China
W
Wei Zhao
Shenzhen University of Advanced Technology, Shenzhen 518055, China