GPR: Towards a Generative Pre-trained One-Model Paradigm for Large-Scale Advertising Recommendation

📅 2025-11-13
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing advertising recommendation systems suffer from objective misalignment and error propagation due to multi-stage cascaded pipelines, while generative models remain insufficient for industrial deployment. This paper proposes the first end-to-end generative pre-training framework tailored for advertising recommendation, unifying user interest modeling and ad generation into a single sequence generation task. We introduce a unified input representation with multi-level semantic ID spaces, an innovative heterogeneous hierarchical dual-decoder architecture, and integrate multi-token prediction, value-aware fine-tuning, and HEPO (Hierarchical Evolutionary Policy Optimization). Deployed at scale in WeChat Channels’ advertising system, our framework significantly improves core business metrics—including GMV and CTCVR—demonstrating the first industrial-scale application of generative advertising recommendation. It validates the effectiveness and scalability of the “one-model, full-pipeline” paradigm.

Technology Category

Application Category

📝 Abstract
As an intelligent infrastructure connecting users with commercial content, advertising recommendation systems play a central role in information flow and value creation within the digital economy. However, existing multi-stage advertising recommendation systems suffer from objective misalignment and error propagation, making it difficult to achieve global optimality, while unified generative recommendation models still struggle to meet the demands of practical industrial applications. To address these issues, we propose GPR (Generative Pre-trained Recommender), the first one-model framework that redefines advertising recommendation as an end-to-end generative task, replacing the traditional cascading paradigm with a unified generative approach. To realize GPR, we introduce three key innovations spanning unified representation, network architecture, and training strategy. First, we design a unified input schema and tokenization method tailored to advertising scenarios, mapping both ads and organic content into a shared multi-level semantic ID space, thereby enhancing semantic alignment and modeling consistency across heterogeneous data. Second, we develop the Heterogeneous Hierarchical Decoder (HHD), a dual-decoder architecture that decouples user intent modeling from ad generation, achieving a balance between training efficiency and inference flexibility while maintaining strong modeling capacity. Finally, we propose a multi-stage joint training strategy that integrates Multi-Token Prediction (MTP), Value-Aware Fine-Tuning and the Hierarchy Enhanced Policy Optimization (HEPO) algorithm, forming a complete generative recommendation pipeline that unifies interest modeling, value alignment, and policy optimization. GPR has been fully deployed in the Tencent Weixin Channels advertising system, delivering significant improvements in key business metrics including GMV and CTCVR.
Problem

Research questions and friction points this paper is trying to address.

Addresses objective misalignment and error propagation in multi-stage advertising recommendation systems
Replaces traditional cascading paradigm with unified generative recommendation framework
Solves challenges in semantic alignment and modeling consistency across heterogeneous data
Innovation

Methods, ideas, or system contributions that make the work stand out.

Unified generative framework replaces cascading recommendation paradigm
Dual-decoder architecture decouples user intent modeling and ad generation
Multi-stage joint training strategy integrates prediction and optimization algorithms
🔎 Similar Papers
No similar papers found.
J
Jun Zhang
Tencent Inc., China
Y
Yi Li
Tencent Inc., China
Y
Yue Liu
Tencent Inc., China
C
Changping Wang
Tencent Inc., China
Y
Yuan Wang
Tencent Inc., China
Y
Yuling Xiong
Tencent Inc., China
X
Xun Liu
Tencent Inc., China
H
Haiyang Wu
Tencent Inc., China
Q
Qian Li
Tencent Inc., China
E
Enming Zhang
Tencent Inc., China
J
Jiawei Sun
Tencent Inc., China
X
Xin Xu
Tencent Inc., China
Z
Zishuai Zhang
Tencent Inc., China
R
Ruoran Liu
Tencent Inc., China
S
Suyuan Huang
Tencent Inc., China
Z
Zhaoxin Zhang
Tencent Inc., China
Z
Zhengkai Guo
Tencent Inc., China
S
Shuojin Yang
Tsinghua University, China
Meng-Hao Guo
Meng-Hao Guo
Postdoc, Tsinghua University
Foundation ModelsReasoningAgentComputer VisionComputer Graphics
H
Huan Yu
Tencent Inc., China
J
Jie Jiang
Tencent Inc., China
Shi-Min Hu
Shi-Min Hu
Tsinghua University
Geometry processingGeometric ModelingComputer GraphicsImage and Video processingComputer Aided Design