🤖 AI Summary
To address the challenges of high-dimensional decision-making, complex heterogeneous constraints (e.g., bandwidth, computational capacity, latency), and strict real-time requirements in personalized recommendation for resource-constrained multimedia scenarios—such as short-video delivery—this paper proposes the first end-to-end personalized content distribution framework based on Generative Flow Networks (GFlowNets). The framework jointly models multi-candidate generation, user behavioral preferences, and system-level resource constraints, enabling differentiable and scalable joint optimization via flow-balancing mechanisms. Compared to conventional rule-based engines and reinforcement learning approaches, our method achieves significantly improved video quality (+12.3% SSIM), resource utilization (+18.7%), and transmission cost-efficiency (−21.5%), while maintaining low inference overhead. It further demonstrates strong cross-platform generalization. The core contribution lies in pioneering the integration of GFlowNets into multimedia systems optimization, enabling the first constraint-aware, generative paradigm for personalized recommendation.
📝 Abstract
Multimedia systems underpin modern digital interactions, facilitating seamless integration and optimization of resources across diverse multimedia applications. To meet growing personalization demands, multimedia systems must efficiently manage competing resource needs, adaptive content, and user-specific data handling. This paper introduces Generative Flow Networks (GFlowNets, GFNs) as a brave new framework for enabling personalized multimedia systems. By integrating multi-candidate generative modeling with flow-based principles, GFlowNets offer a scalable and flexible solution for enhancing user-specific multimedia experiences. To illustrate the effectiveness of GFlowNets, we focus on short video feeds, a multimedia application characterized by high personalization demands and significant resource constraints, as a case study. Our proposed GFlowNet-based personalized feeds algorithm demonstrates superior performance compared to traditional rule-based and reinforcement learning methods across critical metrics, including video quality, resource utilization efficiency, and delivery cost. Moreover, we propose a unified GFlowNet-based framework generalizable to other multimedia systems, highlighting its adaptability and wide-ranging applicability. These findings underscore the potential of GFlowNets to advance personalized multimedia systems by addressing complex optimization challenges and supporting sophisticated multimedia application scenarios.