Collaborative Retrieval for Large Language Model-based Conversational Recommender Systems

πŸ“… 2025-02-19
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
To address the challenge of effectively integrating user behavioral data into large language models (LLMs) for conversational recommendation systems (CRS), this paper proposes CRAGβ€”a Collaborative Retrieval-Augmented Generation framework that enables end-to-end joint modeling of LLMs and collaborative filtering (CF) for the first time. CRAG introduces a collaborative retrieval mechanism that dynamically injects CF-derived user-item interaction signals into both the retrieval and generation stages of the LLM, while jointly leveraging dialogue state and multi-source data to enhance personalized representation learning. Experiments on Reddit-v2 and ReDial demonstrate that CRAG significantly improves recommendation accuracy (e.g., +12.3% Recall@5) and item coverage (+18.7%), with particularly strong performance on long-tail items such as newly released movies. CRAG establishes a novel, interpretable, and scalable behavior-aware paradigm for LLM-driven CRS.

Technology Category

Application Category

πŸ“ Abstract
Conversational recommender systems (CRS) aim to provide personalized recommendations via interactive dialogues with users. While large language models (LLMs) enhance CRS with their superior understanding of context-aware user preferences, they typically struggle to leverage behavioral data, which have proven to be important for classical collaborative filtering (CF)-based approaches. For this reason, we propose CRAG, Collaborative Retrieval Augmented Generation for LLM-based CRS. To the best of our knowledge, CRAG is the first approach that combines state-of-the-art LLMs with CF for conversational recommendations. Our experiments on two publicly available movie conversational recommendation datasets, i.e., a refined Reddit dataset (which we name Reddit-v2) as well as the Redial dataset, demonstrate the superior item coverage and recommendation performance of CRAG, compared to several CRS baselines. Moreover, we observe that the improvements are mainly due to better recommendation accuracy on recently released movies. The code and data are available at https://github.com/yaochenzhu/CRAG.
Problem

Research questions and friction points this paper is trying to address.

Enhance LLM-based CRS with collaborative filtering
Improve recommendation accuracy for new items
Integrate behavioral data into LLM recommendations
Innovation

Methods, ideas, or system contributions that make the work stand out.

LLMs combined with CF
CRAG enhances recommendation accuracy
Improved coverage for new items
πŸ”Ž Similar Papers
No similar papers found.