Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application

๐Ÿ“… 2024-05-07
๐Ÿ›๏ธ arXiv.org
๐Ÿ“ˆ Citations: 3
โœจ Influential: 1
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Existing recommender systems overly rely on item ID embeddings while neglecting rich textual semantics, resulting in poor generalization and robustness. To address this, we propose LEARN, the first framework to integrate a frozen large language model (LLM) as a lightweight, fixed-text encoderโ€”thereby infusing open-world linguistic knowledge into collaborative filtering signals. LEARN employs a dual-tower architecture to jointly model ID-based and text-based representations, and is specifically optimized for low-latency inference in industrial settings. Evaluated on six Amazon Review benchmarks, LEARN achieves state-of-the-art performance. Moreover, on large-scale production data and online A/B tests, it delivers significant improvements: +4.2% in click-through rate (CTR) and +3.8% in conversion rate (CVR), while maintaining high computational efficiency and practical deployability.

Technology Category

Application Category

๐Ÿ“ Abstract
Contemporary recommendation systems predominantly rely on ID embedding to capture latent associations among users and items. However, this approach overlooks the wealth of semantic information embedded within textual descriptions of items, leading to suboptimal performance and poor generalizations. Leveraging the capability of large language models to comprehend and reason about textual content presents a promising avenue for advancing recommendation systems. To achieve this, we propose an Llm-driven knowlEdge Adaptive RecommeNdation (LEARN) framework that synergizes open-world knowledge with collaborative knowledge. We address computational complexity concerns by utilizing pretrained LLMs as item encoders and freezing LLM parameters to avoid catastrophic forgetting and preserve open-world knowledge. To bridge the gap between the open-world and collaborative domains, we design a twin-tower structure supervised by the recommendation task and tailored for practical industrial application. Through experiments on the real large-scale industrial dataset and online A/B tests, we demonstrate the efficacy of our approach in industry application. We also achieve state-of-the-art performance on six Amazon Review datasets to verify the superiority of our method.
Problem

Research questions and friction points this paper is trying to address.

Recommendation Systems
Product Description Information
Performance Improvement
Innovation

Methods, ideas, or system contributions that make the work stand out.

LEARN System
Pre-trained Language Models
Dual Tower Structure Optimization
๐Ÿ”Ž Similar Papers
No similar papers found.
Jian Jia
Jian Jia
Institute of Automation, Chinese Academy of Sciences (CASIA)
computer vision
Y
Yipei Wang
Southeast University, Nanjing, China
Y
Yan Li
Kuaishou Technology, Beijing, China
H
Honggang Chen
Kuaishou Technology, Beijing, China
X
Xuehan Bai
Kuaishou Technology, Beijing, China
Z
Zhaocheng Liu
Kuaishou Technology, Beijing, China
Jian Liang
Jian Liang
Kuaishou Inc.
transfer learninggraph learning
Q
Quan Chen
Kuaishou Technology, Beijing, China
H
Han Li
Kuaishou Technology, Beijing, China
P
Peng Jiang
Kuaishou Technology, Beijing, China
Kun Gai
Kun Gai
Senior Director & Researcher, Alibaba Group
Machine LearningComputational Advertising