Unlocking the Power of Diffusion Models in Sequential Recommendation: A Simple and Effective Approach

πŸ“… 2025-05-26
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
To address the pervasive embedding collapse problem in diffusion-based sequential recommendation, this paper proposes ADRecβ€”a novel framework featuring a token-level independent noise injection mechanism that enables joint diffusion training over entire sequences. ADRec integrates autoregressive modeling with token-level distribution learning and introduces a three-stage progressive training strategy to mitigate embedding collapse. During inference, it performs denoising exclusively on the final token, thereby preserving historical interaction patterns while significantly improving computational efficiency. Evaluated on six public benchmark datasets, ADRec achieves substantial improvements in recommendation accuracy, yielding average gains of +3.2% in Recall@10 and NDCG@10, while reducing inference latency by 42%. To the best of our knowledge, ADRec is the first diffusion-based sequential recommender system to achieve simultaneous optimization of both accuracy and efficiency.

Technology Category

Application Category

πŸ“ Abstract
In this paper, we focus on the often-overlooked issue of embedding collapse in existing diffusion-based sequential recommendation models and propose ADRec, an innovative framework designed to mitigate this problem. Diverging from previous diffusion-based methods, ADRec applies an independent noise process to each token and performs diffusion across the entire target sequence during training. ADRec captures token interdependency through auto-regression while modeling per-token distributions through token-level diffusion. This dual approach enables the model to effectively capture both sequence dynamics and item representations, overcoming the limitations of existing methods. To further mitigate embedding collapse, we propose a three-stage training strategy: (1) pre-training the embedding weights, (2) aligning these weights with the ADRec backbone, and (3) fine-tuning the model. During inference, ADRec applies the denoising process only to the last token, ensuring that the meaningful patterns in historical interactions are preserved. Our comprehensive empirical evaluation across six datasets underscores the effectiveness of ADRec in enhancing both the accuracy and efficiency of diffusion-based sequential recommendation systems.
Problem

Research questions and friction points this paper is trying to address.

Addressing embedding collapse in diffusion-based sequential recommendation models
Proposing ADRec to capture sequence dynamics and item representations
Enhancing accuracy and efficiency in sequential recommendation systems
Innovation

Methods, ideas, or system contributions that make the work stand out.

Independent noise process per token
Three-stage training strategy
Denoising only last token during inference
πŸ”Ž Similar Papers
No similar papers found.