OneRec-Think: In-Text Reasoning for Generative Recommendation

📅 2025-10-13
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Current generative recommendation models (e.g., OneRec) lack explicit, controllable reasoning capabilities, hindering explainable decision-making. To address this, we propose a unified generative recommendation framework integrating dialogue understanding, structured reasoning, and personalized modeling. First, we design Itemic Alignment to achieve cross-modal semantic alignment. Second, we introduce Reasoning Scaffolding—a mechanism that explicitly activates the reasoning capacity of large language models (LLMs). Third, we construct a recommendation-specific reward function tailored to multiple valid-answer preferences, coupled with Think-Ahead reasoning deployment and dialogue-context-aware personalized preference modeling. Evaluated on public benchmarks, our method achieves state-of-the-art performance. Deployed in Kuaishou’s industrial recommender system, it increases average user session duration by 0.159%. This demonstrates that explicit textual reasoning simultaneously enhances both recommendation interpretability and effectiveness.

Technology Category

Application Category

📝 Abstract
The powerful generative capacity of Large Language Models (LLMs) has instigated a paradigm shift in recommendation. However, existing generative models (e.g., OneRec) operate as implicit predictors, critically lacking the capacity for explicit and controllable reasoning-a key advantage of LLMs. To bridge this gap, we propose OneRec-Think, a unified framework that seamlessly integrates dialogue, reasoning, and personalized recommendation. OneRec-Think incorporates: (1) Itemic Alignment: cross-modal Item-Textual Alignment for semantic grounding; (2) Reasoning Activation: Reasoning Scaffolding to activate LLM reasoning within the recommendation context; and (3) Reasoning Enhancement, where we design a recommendation-specific reward function that accounts for the multi-validity nature of user preferences. Experiments across public benchmarks show state-of-the-art performance. Moreover, our proposed "Think-Ahead" architecture enables effective industrial deployment on Kuaishou, achieving a 0.159% gain in APP Stay Time and validating the practical efficacy of the model's explicit reasoning capability.
Problem

Research questions and friction points this paper is trying to address.

Enabling explicit reasoning in generative recommendation systems
Integrating dialogue and reasoning with personalized recommendations
Addressing multi-validity of user preferences through reward functions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Cross-modal item-text alignment for semantic grounding
Reasoning scaffolding activates LLM contextual reasoning
Recommendation-specific reward function handles multi-validity preferences
🔎 Similar Papers
2024-05-17Annual Meeting of the Association for Computational LinguisticsCitations: 4
Zhanyu Liu
Zhanyu Liu
Shanghai Jiao Tong University
Recommendation SystemLarge Language ModelData MiningTime Series Analysis
S
Shiyao Wang
Kuaishou Inc., Beijing, China
X
Xingmei Wang
Kuaishou Inc., Beijing, China
R
Rongzhou Zhang
Kuaishou Inc., Beijing, China
J
Jiaxin Deng
Kuaishou Inc., Beijing, China
H
Honghui Bao
Kuaishou Inc., Beijing, China
Jinghao Zhang
Jinghao Zhang
Kuaishou Tech
Recommender SystemsMultimediaLarge Language Model
W
Wuchao Li
Kuaishou Inc., Beijing, China
Pengfei Zheng
Pengfei Zheng
Huawei Technologies
Machine Learning SystemSystem-Algorithm Co-DesignDistributed SystemData+AI
X
Xiangyu Wu
Kuaishou Inc., Beijing, China
Y
Yifei Hu
Kuaishou Inc., Beijing, China
Q
Qigen Hu
Kuaishou Inc., Beijing, China
Xinchen Luo
Xinchen Luo
kuaishou
L
Lejian Ren
Kuaishou Inc., Beijing, China
Zixing Zhang
Zixing Zhang
Professor, Hunan University
Artifical IntelligenceSpeech ProcessingAffective ComputingDigital HealthAutomatic Speech Recognition
Q
Qianqian Wang
Kuaishou Inc., Beijing, China
K
Kuo Cai
Kuaishou Inc., Beijing, China
Yunfan Wu
Yunfan Wu
Institute of Computing Technology, Chinese Academy of Sciences
Recommender SystemCollaborative FilteringAdversary Attack
H
Hongtao Cheng
Kuaishou Inc., Beijing, China
Z
Zexuan Cheng
Kuaishou Inc., Beijing, China
L
Lu Ren
Kuaishou Inc., Beijing, China
H
Huanjie Wang
Kuaishou Inc., Beijing, China
Y
Yi Su
Kuaishou Inc., Beijing, China
R
Ruiming Tang
Kuaishou Inc., Beijing, China
Kun Gai
Kun Gai
Senior Director & Researcher, Alibaba Group
Machine LearningComputational Advertising
Guorui Zhou
Guorui Zhou
Unknown affiliation
Recommender System,Advertising,Artificial Intelligence,Machine Learning,NLP