DTN: Deep Multiple Task-specific Feature Interactions Network for Multi-Task Recommendation

๐Ÿ“… 2024-08-21
๐Ÿ›๏ธ arXiv.org
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Existing multi-task learning (MTL) recommendation modelsโ€”e.g., MMoE and PLEโ€”neglect both intra-feature interactions and inter-task variations in feature importance, thereby limiting high-order representation learning. To address this, we propose a task-specific feature interaction and sensitivity joint modeling framework. Specifically, we design a multi-path task-adaptive feature interaction module to explicitly capture heterogeneous feature importance across tasks, and introduce a feature-importance-aware gating mechanism for dynamic weight allocation. The entire model is trained end-to-end. On a large-scale e-commerce dataset containing 6.3 billion samples, our method significantly outperforms state-of-the-art MTL baselines including MMoE and PLE. Online A/B testing demonstrates improvements of +3.28% in CTR, +3.10% in order volume, and +2.70% in GMV. Furthermore, extensive experiments on public benchmarks validate its cross-domain generalization capability.

Technology Category

Application Category

๐Ÿ“ Abstract
Neural-based multi-task learning (MTL) has been successfully applied to many recommendation applications. However, these MTL models (e.g., MMoE, PLE) did not consider feature interaction during the optimization, which is crucial for capturing complex high-order features and has been widely used in ranking models for real-world recommender systems. Moreover, through feature importance analysis across various tasks in MTL, we have observed an interesting divergence phenomenon that the same feature can have significantly different importance across different tasks in MTL. To address these issues, we propose Deep Multiple Task-specific Feature Interactions Network (DTN) with a novel model structure design. DTN introduces multiple diversified task-specific feature interaction methods and task-sensitive network in MTL networks, enabling the model to learn task-specific diversified feature interaction representations, which improves the efficiency of joint representation learning in a general setup. We applied DTN to our company's real-world E-commerce recommendation dataset, which consisted of over 6.3 billion samples, the results demonstrated that DTN significantly outperformed state-of-the-art MTL models. Moreover, during online evaluation of DTN in a large-scale E-commerce recommender system, we observed a 3.28% in clicks, a 3.10% increase in orders and a 2.70% increase in GMV (Gross Merchandise Value) compared to the state-of-the-art MTL models. Finally, extensive offline experiments conducted on public benchmark datasets demonstrate that DTN can be applied to various scenarios beyond recommendations, enhancing the performance of ranking models.
Problem

Research questions and friction points this paper is trying to address.

Capturing complex high-order feature interactions in MTL
Addressing feature importance divergence across different tasks
Improving joint representation learning efficiency in recommendations
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multiple task-specific feature interaction methods
Task-sensitive network in MTL
Diversified feature interaction representations
๐Ÿ”Ž Similar Papers
No similar papers found.
Y
Yaowen Bi
Shopee Pte Ltd, Singapore
Y
Yuteng Lian
Shopee Pte Ltd, Singapore
J
Jie Cui
Shopee Pte Ltd, Singapore
J
Jun Liu
Shopee Pte Ltd, Singapore
P
Peijian Wang
Shopee Pte Ltd, Singapore
G
Guanghui Li
Shopee Pte Ltd, Singapore
X
Xuejun Chen
Shopee Pte Ltd, Singapore
J
Jinglin Zhao
Shopee Pte Ltd, Singapore
H
Hao Wen
Shopee Pte Ltd, Singapore
J
Jing Zhang
Shopee Pte Ltd, Singapore
Z
Zhaoqi Zhang
Shopee Pte Ltd, Singapore
Wenzhuo Song
Wenzhuo Song
Shopee Pte Ltd, Singapore
Y
Yang Sun
Shopee Pte Ltd, Singapore
W
Weiwei Zhang
Shopee Pte Ltd, Singapore
M
Mingchen Cai
Shopee Pte Ltd, Singapore
G
Guanxing Zhang
Shopee Pte Ltd, Singapore