MARCO: A Cooperative Knowledge Transfer Framework for Personalized Cross-domain Recommendations

📅 2025-10-06
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address negative transfer in cross-domain recommendation—caused by cold-start and data sparsity issues stemming from imbalanced source-domain contributions and distributional discrepancies—this paper proposes a Multi-Agent Reinforcement Learning-based Knowledge Transfer framework (MAKT). MAKT models each source domain as an independent agent and employs a cooperative credit assignment mechanism to dynamically optimize domain-specific contribution weights. It further introduces an entropy-driven action diversity penalty to enhance policy expressiveness and training stability, and integrates cross-domain embedding alignment to ensure effective knowledge transfer. Extensive experiments on four benchmark datasets demonstrate that MAKT significantly outperforms existing single-agent and conventional cross-domain recommendation methods, achieving superior generalization and robustness under varying data conditions. The implementation code is publicly available.

Technology Category

Application Category

📝 Abstract
Recommender systems frequently encounter data sparsity issues, particularly when addressing cold-start scenarios involving new users or items. Multi-source cross-domain recommendation (CDR) addresses these challenges by transferring valuable knowledge from multiple source domains to enhance recommendations in a target domain. However, existing reinforcement learning (RL)-based CDR methods typically rely on a single-agent framework, leading to negative transfer issues caused by inconsistent domain contributions and inherent distributional discrepancies among source domains. To overcome these limitations, MARCO, a Multi-Agent Reinforcement Learning-based Cross-Domain recommendation framework, is proposed. It leverages cooperative multi-agent reinforcement learning, where each agent is dedicated to estimating the contribution from an individual source domain, effectively managing credit assignment and mitigating negative transfer. In addition, an entropy-based action diversity penalty is introduced to enhance policy expressiveness and stabilize training by encouraging diverse agents' joint actions. Extensive experiments across four benchmark datasets demonstrate MARCO's superior performance over state-of-the-art methods, highlighting its robustness and strong generalization capabilities. The code is at https://github.com/xiewilliams/MARCO.
Problem

Research questions and friction points this paper is trying to address.

Addressing data sparsity in cold-start recommender systems
Mitigating negative transfer in cross-domain recommendation
Managing inconsistent domain contributions via multi-agent learning
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-agent reinforcement learning for cross-domain recommendation
Entropy-based action diversity penalty stabilizes training
Individual agents estimate contributions from source domains
🔎 Similar Papers
No similar papers found.
L
Lili Xie
Independent Researcher, Shen Zhen, China
Y
Yi Zhang
The University of Queensland, CSIRO DATA61, Brisbane, Australia
Ruihong Qiu
Ruihong Qiu
ARC DECRA Fellow, Lecturer (Assistant Professor) @The University of Queensland
GraphLarge Language Models
J
Jiajun Liu
CSIRO DATA61, The University of Queensland, Brisbane, Australia
S
Sen Wang
The University of Queensland, Brisbane, Australia