Non-autoregressive Generative Models for Reranking Recommendation

πŸ“… 2024-02-10
πŸ›οΈ Knowledge Discovery and Data Mining
πŸ“ˆ Citations: 3
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
To address high inference latency and weak sequence modeling in multi-stage recommender systems’ re-ranking tasks under real-time industrial settings, this paper proposes NAR4Rec, a non-autoregressive generative re-ranking model. It is the first to introduce the non-autoregressive paradigm into recommendation re-ranking. We design a sequence-level unlikelihood training objective to suppress invalid sequences, incorporate matching-aware auxiliary modeling to enhance intra-list item correlation learning, and propose a contrastive decoding mechanism to improve feasibility discrimination of candidate sequences. The model supports end-to-end joint training and efficient inference. Offline experiments demonstrate significant improvements over state-of-the-art methods. Online A/B tests show statistically significant gains in click-through rate (CTR) and user session duration. NAR4Rec has been fully deployed in the Kuaishou mobile application, serving over 300 million daily active users.

Technology Category

Application Category

πŸ“ Abstract
Contemporary recommendation systems are designed to meet users' needs by delivering tailored lists of items that align with their specific demands or interests. In a multi-stage recommendation system, reranking plays a crucial role by modeling the intra-list correlations among items. The key challenge of reranking lies in the exploration of optimal sequences within the combinatorial space of permutations. Recent research proposes a generator-evaluator learning paradigm, where the generator generates multiple feasible sequences and the evaluator picks out the best sequence based on the estimated listwise score. The generator is of vital importance, and generative models are well-suited for the generator function. Current generative models employ an autoregressive strategy for sequence generation. However, deploying autoregressive models in real-time industrial systems is challenging. To address these issues, we propose a Non-AutoRegressive generative model for reranking Recommendation (NAR4Rec) designed to enhance efficiency and effectiveness. To tackle challenges such as sparse training samples and dynamic candidates, we introduce a matching model. Considering the diverse nature of user feedback, we employ a sequence-level unlikelihood training objective to differentiate feasible sequences from unfeasible ones. Additionally, to overcome the lack of dependency modeling in non-autoregressive models regarding target items, we introduce contrastive decoding to capture correlations among these items. Extensive offline experiments validate the superior performance of NAR4Rec over state-of-the-art reranking methods. Online A/B tests reveal that NAR4Rec significantly enhances the user experience. Furthermore, NAR4Rec has been fully deployed in a popular video app Kuaishou with over 300 million daily active users.
Problem

Research questions and friction points this paper is trying to address.

Enhance reranking efficiency in recommendation systems
Address sparse training samples and dynamic candidates
Improve dependency modeling in non-autoregressive models
Innovation

Methods, ideas, or system contributions that make the work stand out.

Non-autoregressive model for efficient reranking
Matching model handles sparse training samples
Contrastive decoding captures item correlations
Y
Yuxin Ren
Kuaishou Technology, Beijing, China
Q
Qiya Yang
Peking University, Beijing, China
Y
Yichun Wu
Tsinghua University, Beijing, China
W
Wei Xu
Kuaishou Technology, Beijing, China
Y
Yalong Wang
Kuaishou Technology, Beijing, China
Z
Zhiqiang Zhang
Kuaishou Technology, Beijing, China