UniSearch: Rethinking Search System with a Unified Generative Architecture

📅 2025-09-08
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Traditional search cascade architectures suffer from modular fragmentation and misaligned optimization objectives, hindering end-to-end optimization and generalization. This paper introduces UniSearch—a truly end-to-end generative search framework tailored for Kuaishou’s short-video and live-streaming scenarios—where a unified generator directly models the “query → content” mapping. Our key contributions are: (1) a jointly optimized generator-video encoder architecture; (2) Search Preference Optimization (SPO), which integrates user behavioral feedback and reward modeling to align generated outputs with user preferences; and (3) a unified training paradigm combining latent item embeddings, tokenized representations, and behavior-driven reinforcement learning. Online A/B testing demonstrates that UniSearch achieves the largest single-experiment gain in live-stream search to date, validating its effectiveness and scalability for large-scale deployment.

Technology Category

Application Category

📝 Abstract
Modern search systems play a crucial role in facilitating information acquisition. Traditional search engines typically rely on a cascaded architecture, where results are retrieved through recall, pre-ranking, and ranking stages. The complexity of designing and maintaining multiple modules makes it difficult to achieve holistic performance gains. Recent advances in generative recommendation have motivated the exploration of unified generative search as an alternative. However, existing approaches are not genuinely end-to-end: they typically train an item encoder to tokenize candidates first and then optimize a generator separately, leading to objective inconsistency and limited generalization. To address these limitations, we propose UniSearch, a unified generative search framework for Kuaishou Search. UniSearch replaces the cascaded pipeline with an end-to-end architecture that integrates a Search Generator and a Video Encoder. The Generator produces semantic identifiers of relevant items given a user query, while the Video Encoder learns latent item embeddings and provides their tokenized representations. A unified training framework jointly optimizes both components, enabling mutual enhancement and improving representation quality and generation accuracy. Furthermore, we introduce Search Preference Optimization (SPO), which leverages a reward model and real user feedback to better align generation with user preferences. Extensive experiments on industrial-scale datasets, together with online A/B testing in both short-video and live search scenarios, demonstrate the strong effectiveness and deployment potential of UniSearch. Notably, its deployment in live search yields the largest single-experiment improvement in recent years of our product's history, highlighting its practical value for real-world applications.
Problem

Research questions and friction points this paper is trying to address.

Unified generative search to replace cascaded architecture
Address objective inconsistency in existing generative approaches
Improve generalization and representation quality in search systems
Innovation

Methods, ideas, or system contributions that make the work stand out.

Unified end-to-end generative search architecture
Joint optimization of generator and video encoder
Search Preference Optimization with user feedback alignment
🔎 Similar Papers
No similar papers found.
J
Jiahui Chen
Kuaishou Technology, Beijing, China
X
Xiaoze Jiang
Kuaishou Technology, Beijing, China
Z
Zhibo Wang
Kuaishou Technology, Beijing, China
Q
Quanzhi Zhu
Kuaishou Technology, Beijing, China
Junyao Zhao
Junyao Zhao
IRIF (CNRS & Université Paris Cité)
algorithmic game theorycombinatorial optimizationmachine learning
F
Feng Hu
Kuaishou Technology, Beijing, China
K
Kang Pan
Kuaishou Technology, Beijing, China
A
Ao Xie
Kuaishou Technology, Beijing, China
M
Maohua Pei
Kuaishou Technology, Beijing, China
Z
Zhiheng Qin
Kuaishou Technology, Beijing, China
H
Hongjing Zhang
Kuaishou Technology, Beijing, China
Z
Zhixin Zhai
Kuaishou Technology, Beijing, China
Xiaobo Guo
Xiaobo Guo
Dartmouth College
machine learningdeep learningnatural language processingsocia mediapropagantion
R
Runbin Zhou
Kuaishou Technology, Beijing, China
K
Kefeng Wang
Kuaishou Technology, Beijing, China
Mingyang Geng
Mingyang Geng
National University of Defense Technology
deep learningsoftware engineering
C
Cheng Chen
Kuaishou Technology, Beijing, China
J
Jingshan Lv
Kuaishou Technology, Beijing, China
Y
Yupeng Huang
Kuaishou Technology, Beijing, China
X
Xiao Liang
Kuaishou Technology, Beijing, China
H
Han Li
Kuaishou Technology, Beijing, China