EGRA:Toward Enhanced Behavior Graphs and Representation Alignment for Multimodal Recommendation

📅 2025-08-22
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing multimodal recommendation systems directly utilize raw modality features to construct behavioral graphs, compromising the synergy between collaborative filtering and modality semantics while remaining vulnerable to modality noise. Moreover, their static, uniform modality–behavior alignment weights hinder effective representation fusion. To address these issues, we propose an enhanced multimodal recommendation framework. First, we leverage pretrained models to extract robust item representations and construct a semantically grounded item-association graph, effectively suppressing modality noise. Second, we introduce a two-level dynamic alignment mechanism: (i) entity-level adaptive weighting and (ii) training-step-aware progressive enhancement of overall alignment strength—enabling fine-grained, time-varying alignment between modality and behavioral representations. Integrating multimodal pretraining, graph neural networks, and end-to-end optimization, our method achieves significant improvements over state-of-the-art approaches across five benchmark datasets, demonstrating superior effectiveness and robustness.

Technology Category

Application Category

📝 Abstract
MultiModal Recommendation (MMR) systems have emerged as a promising solution for improving recommendation quality by leveraging rich item-side modality information, prompting a surge of diverse methods. Despite these advances, existing methods still face two critical limitations. First, they use raw modality features to construct item-item links for enriching the behavior graph, while giving limited attention to balancing collaborative and modality-aware semantics or mitigating modality noise in the process. Second, they use a uniform alignment weight across all entities and also maintain a fixed alignment strength throughout training, limiting the effectiveness of modality-behavior alignment. To address these challenges, we propose EGRA. First, instead of relying on raw modality features, it alleviates sparsity by incorporating into the behavior graph an item-item graph built from representations generated by a pretrained MMR model. This enables the graph to capture both collaborative patterns and modality aware similarities with enhanced robustness against modality noise. Moreover, it introduces a novel bi-level dynamic alignment weighting mechanism to improve modality-behavior representation alignment, which dynamically assigns alignment strength across entities according to their alignment degree, while gradually increasing the overall alignment intensity throughout training. Extensive experiments on five datasets show that EGRA significantly outperforms recent methods, confirming its effectiveness.
Problem

Research questions and friction points this paper is trying to address.

Enhancing behavior graphs with collaborative and modality-aware semantics
Mitigating modality noise in multimodal recommendation systems
Improving modality-behavior alignment with dynamic weighting mechanism
Innovation

Methods, ideas, or system contributions that make the work stand out.

Enhanced behavior graph with pretrained representations
Bi-level dynamic alignment weighting mechanism
Dynamic entity-specific alignment strength assignment
🔎 Similar Papers
No similar papers found.
X
Xiaoxiong Zhang
College of Computing and Data Science, Nanyang Technological University, Singapore
X
Xin Zhou
College of Computing and Data Science, Nanyang Technological University, Singapore
Zhiwei Zeng
Zhiwei Zeng
Nanyang Technological University
Explainable AIAI for HealthcareAI for Gerontology
Yongjie Wang
Yongjie Wang
Nanyang Technological University
Explainable AIInterpretabilityMachine LearningTrustworthy AI
D
Dusit Niyato
College of Computing and Data Science, Nanyang Technological University, Singapore
Zhiqi Shen
Zhiqi Shen
Nanyang Technological University
Goal ModelingSoftware AgentsIntelligent AgentsHealth GamesEducational Games