A Comprehensive Survey on Retrieval Methods in Recommender Systems

📅 2024-07-11
🏛️ arXiv.org
📈 Citations: 15
Influential: 1
📄 PDF
🤖 AI Summary
Under information overload, the retrieval stage in recommender systems has long been underappreciated and lacks systematic investigation. This paper presents the first comprehensive survey of retrieval in industrial multi-stage recommendation pipelines, focusing on three core aspects: user-item similarity modeling, efficient indexing mechanisms (e.g., vector search and inverted indices), and training optimization techniques—including dual-tower architectures, contrastive learning, and negative sampling. We introduce a unified evaluation benchmark spanning three public datasets and integrate insights from leading industry practitioners to holistically characterize deployment practices, performance bottlenecks, and engineering challenges. Our work fills a critical gap in the systematic analysis of retrieval and provides both theoretical foundations and practical paradigms for designing accurate, efficient, and production-ready retrieval components within cascaded recommendation systems.

Technology Category

Application Category

📝 Abstract
In an era dominated by information overload, effective recommender systems are essential for managing the deluge of data across digital platforms. Multi-stage cascade ranking systems are widely used in the industry, with retrieval and ranking being two typical stages. Retrieval methods sift through vast candidates to filter out irrelevant items, while ranking methods prioritize these candidates to present the most relevant items to users. Unlike studies focusing on the ranking stage, this survey explores the critical yet often overlooked retrieval stage of recommender systems. To achieve precise and efficient personalized retrieval, we summarize existing work in three key areas: improving similarity computation between user and item, enhancing indexing mechanisms for efficient retrieval, and optimizing training methods of retrieval. We also provide a comprehensive set of benchmarking experiments on three public datasets. Furthermore, we highlight current industrial applications through a case study on retrieval practices at a specific company, covering the entire retrieval process and online serving, along with practical implications and challenges. By detailing the retrieval stage, which is fundamental for effective recommendation, this survey aims to bridge the existing knowledge gap and serve as a cornerstone for researchers interested in optimizing this critical component of cascade recommender systems.
Problem

Research questions and friction points this paper is trying to address.

Survey explores retrieval methods in recommender systems
Improves similarity computation between users and items
Enhances indexing mechanisms for efficient retrieval
Innovation

Methods, ideas, or system contributions that make the work stand out.

Improving user-item similarity computation methods
Enhancing indexing mechanisms for efficient retrieval
Optimizing training methods for retrieval systems
🔎 Similar Papers
No similar papers found.
J
Junjie Huang
Shanghai Jiao Tong University, China
J
Jizheng Chen
Shanghai Jiao Tong University, China
Jianghao Lin
Jianghao Lin
Shanghai Jiao Tong University
Large Language ModelsAI AgentsRecommender Systems
Jiarui Qin
Jiarui Qin
Tencent
Large Language ModelRecommender SystemsInformation Retrieval
Z
Ziming Feng
China Merchants Bank Credit Card Center, China
W
Weinan Zhang
Shanghai Jiao Tong University, China
Yong Yu
Yong Yu
Materials Engineer
Polymer matrix compositeadhesivemodelingtest development