Improving Multi-modal Recommender Systems by Denoising and Aligning Multi-modal Content and User Feedback

📅 2024-06-18
🏛️ Knowledge Discovery and Data Mining
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address three key challenges in multimodal recommendation—content noise, feedback noise, and modality-behavior misalignment—this paper proposes DA-MRS, a denoising and alignment framework. DA-MRS constructs a cross-modal consistency graph to explicitly model inter-modal relationships; introduces a content-guided denoising Bayesian Personalized Ranking (BPR) loss for probabilistic feedback calibration; and pioneers a dual-alignment mechanism jointly driven by user preferences and fine-grained item relations to enhance representation consistency. The framework is plug-and-play, fully compatible with mainstream backbone models. Extensive experiments across multiple benchmark datasets and diverse noise settings demonstrate that DA-MRS consistently and significantly improves recommendation performance. Notably, it exhibits exceptional robustness under high-noise conditions, validating its strong generalizability and practical applicability in real-world noisy multimodal scenarios.

Technology Category

Application Category

📝 Abstract
Multi-modal recommender systems (MRSs) are pivotal in diverse online web platforms and have garnered considerable attention in recent years. However, previous studies overlook the challenges of (1) noisy multi-modal content, (2) noisy user feedback, and (3) aligning multi-modal content with user feedback. In order to tackle these challenges, we propose Denoising and Aligning Multi-modal Recommender System (DA-MRS). To mitigate multi-modal noise, DA-MRS first constructs item-item graphs determined by consistent content similarity across modalities. To denoise user feedback, DA-MRS associates the probability of observed feedback with multi-modal content and devises a denoised BPR loss. Furthermore, DA-MRS implements Alignment guided by User preference to enhance task-specific item representation and Alignment guided by graded Item relations to provide finer-grained alignment. Extensive experiments verify that DA-MRS is a plug-and-play framework and achieves significant and consistent improvements across various datasets, backbone models, and noisy scenarios.
Problem

Research questions and friction points this paper is trying to address.

Addressing noisy multi-modal content in recommender systems
Reducing noise in user feedback for better recommendations
Aligning multi-modal content with user feedback effectively
Innovation

Methods, ideas, or system contributions that make the work stand out.

Constructs item-item graphs for multi-modal noise reduction
Devises denoised BPR loss for user feedback
Implements user and item-guided alignment strategies
🔎 Similar Papers
No similar papers found.
G
Guipeng Xv
School of Informatics, Xiamen University, Xiamen, China
X
Xinyu Li
School of Informatics, Xiamen University, Xiamen, China
Ruobing Xie
Ruobing Xie
Tencent
Large Language ModelRecommender SystemNatural Language Processing
C
Chen Lin
School of Informatics, Xiamen University, Xiamen, China
C
Chong Liu
Tencent, Beijing, China
Feng Xia
Feng Xia
Professor, School of Computing Technologies, RMIT University
Artificial IntelligenceGraph LearningBrainRoboticsCyber-Physical Systems
Z
Zhanhui Kang
Tencent, Beijing, China
L
Leyu Lin
Tencent, Beijing, China