RecGPT Technical Report

📅 2025-07-30

📈 Citations: 0

✨ Influential: 0

career value

189K/year

🤖 AI Summary

Industrial recommender systems overly rely on historical co-occurrence patterns and log fitting, neglecting explicit user intent modeling—leading to interest overfitting, filter bubbles, and long-tail entrenchment. To address this, we propose RecGPT, the first intent-centric recommendation framework that unifies intent discovery, multi-granularity retrieval, and interpretable generation via large language models (LLMs). We design a two-stage training mechanism: reasoning-enhanced pre-alignment followed by self-training evolution. Furthermore, we introduce a human-in-the-loop evaluation system to guide iterative optimization. Deployed at scale in the Taobao mobile application, online A/B tests demonstrate simultaneous improvements in user satisfaction, content diversity, merchant exposure, and platform conversion rate. These results empirically validate the critical role of intent-driven recommendation in fostering sustainable ecosystem health.

Technology Category

Application Category

📝 Abstract

Recommender systems are among the most impactful applications of artificial intelligence, serving as critical infrastructure connecting users, merchants, and platforms. However, most current industrial systems remain heavily reliant on historical co-occurrence patterns and log-fitting objectives, i.e., optimizing for past user interactions without explicitly modeling user intent. This log-fitting approach often leads to overfitting to narrow historical preferences, failing to capture users' evolving and latent interests. As a result, it reinforces filter bubbles and long-tail phenomena, ultimately harming user experience and threatening the sustainability of the whole recommendation ecosystem. To address these challenges, we rethink the overall design paradigm of recommender systems and propose RecGPT, a next-generation framework that places user intent at the center of the recommendation pipeline. By integrating large language models (LLMs) into key stages of user interest mining, item retrieval, and explanation generation, RecGPT transforms log-fitting recommendation into an intent-centric process. To effectively align general-purpose LLMs to the above domain-specific recommendation tasks at scale, RecGPT incorporates a multi-stage training paradigm, which integrates reasoning-enhanced pre-alignment and self-training evolution, guided by a Human-LLM cooperative judge system. Currently, RecGPT has been fully deployed on the Taobao App. Online experiments demonstrate that RecGPT achieves consistent performance gains across stakeholders: users benefit from increased content diversity and satisfaction, merchants and the platform gain greater exposure and conversions. These comprehensive improvement results across all stakeholders validates that LLM-driven, intent-centric design can foster a more sustainable and mutually beneficial recommendation ecosystem.

Problem

Research questions and friction points this paper is trying to address.

Overcoming reliance on historical co-occurrence patterns in recommender systems

Addressing overfitting to narrow user preferences and filter bubbles

Enhancing user intent modeling for sustainable recommendation ecosystems

Innovation

Methods, ideas, or system contributions that make the work stand out.

Integrates LLMs for intent-centric recommendations

Uses multi-stage training with reasoning-enhanced alignment

Deploys Human-LLM cooperative judge system

🔎 Similar Papers

No similar papers found.