AIGQ: An End-to-End Hybrid Generative Architecture for E-commerce Query Recommendation

📅 2026-03-20
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Traditional e-commerce query suggestion methods rely on ID matching and co-click heuristics, suffering from shallow semantic understanding, poor cold-start performance, and low serendipity. This work proposes AIGQ—the first end-to-end generative query recommendation framework tailored for the HintQ scenario—introducing two key innovations: Interest-aware List-wise Supervised Fine-Tuning (IL-SFT) and List-wise Group Relative Policy Optimization (IL-GRPO). Furthermore, AIGQ employs a hybrid offline-online deployment architecture that integrates nearline generation (AIGQ-Direct) with trigger-word reasoning augmentation (AIGQ-Think). Large-scale experiments on the Taobao platform demonstrate that AIGQ significantly improves core metrics including platform efficiency and user engagement.

Technology Category

Application Category

📝 Abstract
Pre-search query recommendation, widely known as HintQ on Taobao's homepage, plays a vital role in intent capture and demand discovery, yet traditional methods suffer from shallow semantics, poor cold-start performance and low serendipity due to reliance on ID-based matching and co-click heuristics. To overcome these challenges, we propose AIGQ (AI-Generated Query architecture), the first end-to-end generative framework for HintQ scenario. AIGQ is built upon three core innovations spanning training paradigm, policy optimization and deployment architecture. First, we propose Interest-Aware List Supervised Fine-Tuning (IL-SFT), a list-level supervised learning approach that constructs training samples through session-aware behavior aggregation and interest-guided re-ranking strategy to faithfully model nuanced user intent. Accordingly, we design Interest-aware List Group Relative Policy Optimization (IL-GRPO), a novel policy gradient algorithm with a dual-component reward mechanism that jointly optimizes individual query relevance and global list properties, enhanced by a model-based reward from the online click-through rate (CTR) ranking model. To deploy under strict real-time and low-latency requirements, we further develop a hybrid offline-online architecture comprising AIGQ-Direct for nearline personalized user-to-query generation and AIGQ-Think, a reasoning-enhanced variant that produces trigger-to-query mappings to enrich interest diversity. Extensive offline evaluations and large-scale online A/B experiments on Taobao demonstrate that AIGQ consistently delivers substantial improvements in key business metrics across platform effectiveness and user engagement.
Problem

Research questions and friction points this paper is trying to address.

query recommendation
cold-start
shallow semantics
serendipity
e-commerce
Innovation

Methods, ideas, or system contributions that make the work stand out.

generative query recommendation
list-level supervised fine-tuning
policy optimization
hybrid offline-online architecture
interest-aware modeling
🔎 Similar Papers
No similar papers found.
J
Jingcao Xu
Taobao & Tmall Group of Alibaba
J
Jianyun Zou
Taobao & Tmall Group of Alibaba
R
Renkai Yang
Taobao & Tmall Group of Alibaba
Z
Zili Geng
University of Electronic Science and Technology of China
Q
Qiang Liu
Taobao & Tmall Group of Alibaba
Haihong Tang
Haihong Tang
Alibaba-inc
Information retrievalRecommendation SystemNatural language processingImage&&Video Processing and understandingData mining