LongRetriever: Towards Ultra-Long Sequence based Candidate Retrieval for Recommendation

📅 2025-08-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing industrial recommendation systems struggle to effectively model ultra-long user behavior sequences during the recall stage, leading to insufficient fine-grained interest representation. To address this, we propose LongRetriever—the first framework to incorporate ultra-long sequence modeling into recall. It introduces a candidate-aware contextual training paradigm to enable fine-grained interaction between behavioral sequences and candidate items; designs a multi-context retrieval mechanism to ensure consistency between training and online serving; and integrates contextual learning, candidate-specific interaction, and search-based multi-context retrieval within a scalable online optimization architecture. Deployed on a large-scale e-commerce platform, LongRetriever significantly improves online click-through rate (CTR) and conversion rate (CVR), and has been stably serving billions of users in production.

Technology Category

Application Category

📝 Abstract
Precisely modeling user ultra-long sequences is critical for industrial recommender systems. Current approaches predominantly focus on leveraging ultra-long sequences in the ranking stage, whereas research for the candidate retrieval stage remains under-explored. This paper presents LongRetriever, a practical framework for incorporating ultra-long sequences into the retrieval stage of recommenders. Specifically, we propose in-context training and multi-context retrieval, which enable candidate-specific interaction between user sequence and candidate item, and ensure training-serving consistency under the search-based paradigm. Extensive online A/B testing conducted on a large-scale e-commerce platform demonstrates statistically significant improvements, confirming the framework's effectiveness. Currently, LongRetriever has been fully deployed in the platform, impacting billions of users.
Problem

Research questions and friction points this paper is trying to address.

Modeling ultra-long user sequences for recommendation systems
Incorporating ultra-long sequences into candidate retrieval stage
Enabling candidate-specific interaction with user sequences
Innovation

Methods, ideas, or system contributions that make the work stand out.

In-context training for candidate-specific interaction
Multi-context retrieval ensuring training-serving consistency
Ultra-long sequence incorporation in retrieval stage
🔎 Similar Papers
No similar papers found.
Qin Ren
Qin Ren
Stony Brook University
Deep LearningMedical Image Analysis
Z
Zheng Chai
ByteDance, Hangzhou, China
X
Xijun Xiao
ByteDance, Beijing, China
Y
Yuchao Zheng
ByteDance, Beijing, China
D
Di Wu
ByteDance, Beijing, China