OneSearch-V2: The Latent Reasoning Enhanced Self-distillation Generative Search Framework

📅 2026-03-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work proposes a generative search framework that integrates implicit reasoning and self-distillation to address key limitations in existing systems, including inadequate comprehension of complex queries, insufficient user intent modeling, and overfitting to historical preferences. The framework features a chain-of-thought–enhanced query understanding module, a self-distillation training mechanism with internalized reasoning, and a behavior-based preference alignment strategy leveraging user feedback to achieve more accurate intent capture and result generation. Online A/B experiments demonstrate significant improvements in product click-through rate (+3.98%), buyer conversion rate (+3.05%), and order volume (+2.11%). Furthermore, the approach enhances search relevance and page-level user experience while effectively mitigating filter bubbles and long-tail sparsity issues.

Technology Category

Application Category

📝 Abstract
Generative Retrieval (GR) has emerged as a promising paradigm for modern search systems. Compared to multi-stage cascaded architecture, it offers advantages such as end-to-end joint optimization and high computational efficiency. OneSearch, as a representative industrial-scale deployed generative search framework, has brought significant commercial and operational benefits. However, its inadequate understanding of complex queries, inefficient exploitation of latent user intents, and overfitting to narrow historical preferences have limited its further performance improvement. To address these challenges, we propose \textbf{OneSearch-V2}, a latent reasoning enhanced self-distillation generative search framework. It contains three key innovations: (1) a thought-augmented complex query understanding module, which enables deep query understanding and overcomes the shallow semantic matching limitations of direct inference; (2) a reasoning-internalized self-distillation training pipeline, which uncovers users' potential yet precise e-commerce intentions beyond log-fitting through implicit in-context learning; (3) a behavior preference alignment optimization system, which mitigates reward hacking arising from the single conversion metric, and addresses personal preference via direct user feedback. Extensive offline evaluations demonstrate OneSearch-V2's strong query recognition and user profiling capabilities. Online A/B tests further validate its business effectiveness, yielding +3.98\% item CTR, +3.05\% buyer conversion rate, and +2.11\% order volume. Manual evaluation further confirms gains in search experience quality, with +1.65\% in page good rate and +1.37\% in query-item relevance. More importantly, OneSearch-V2 effectively mitigates common search system issues such as information bubbles and long-tail sparsity, without incurring additional inference costs or serving latency.
Problem

Research questions and friction points this paper is trying to address.

Generative Retrieval
complex query understanding
latent user intent
overfitting to historical preferences
information bubbles
Innovation

Methods, ideas, or system contributions that make the work stand out.

Generative Retrieval
Latent Reasoning
Self-distillation
Query Understanding
Preference Alignment
🔎 Similar Papers
No similar papers found.
Ben Chen
Ben Chen
KuaiShou, Alibaba, HUST, WHU
MultimodalLLMGenerative RecommendationSemantic Matching
S
Siyuan Wang
Kuaishou Technology, Beijing, China
Yufei Ma
Yufei Ma
Peking University
Neural Network AcceleratorComputing-in-MemoryFPGA DesignNeuromorphic Computing
Z
Zihan Liang
Kuaishou Technology, Beijing, China
X
Xuxin Zhang
Kuaishou Technology, Beijing, China
Y
Yue Lv
Kuaishou Technology, Beijing, China
Y
Ying Yang
Kuaishou Technology, Beijing, China
H
Huangyu Dai
Kuaishou Technology, Beijing, China
L
Lingtao Mao
Kuaishou Technology, Beijing, China
Tong Zhao
Tong Zhao
Head of US TikTok Mall Recommendation
Machine LearningGraph MiningRecommendationNLPDeep Learning
Z
Zhipeng Qian
Kuaishou Technology, Beijing, China
X
Xinyu Sun
Kuaishou Technology, Beijing, China
Z
Zhixin Zhai
Kuaishou Technology, Beijing, China
Y
Yang Zhao
Kuaishou Technology, Beijing, China
Bochao Liu
Bochao Liu
Unknown affiliation
J
Jingshan Lv
Kuaishou Technology, Beijing, China
X
Xiao Liang
Kuaishou Technology, Beijing, China
H
Hui Kong
Kuaishou Technology, Beijing, China
J
Jing Chen
Kuaishou Technology, Beijing, China
H
Han Li
Kuaishou Technology, Beijing, China
Chenyi Lei
Chenyi Lei
Kuaishou Technology
Recommender SystemInformation RetrievalGenerative RecommendationMultimodal
W
Wenwu Ou
Kuaishou Technology, Beijing, China
Kun Gai
Kun Gai
Senior Director & Researcher, Alibaba Group
Machine LearningComputational Advertising