🤖 AI Summary
This work addresses the limitations of existing multimodal sequential recommendation methods, which struggle to effectively model cross-modal synergistic information and rely on fixed fusion weights that lack personalization. To overcome these issues, we propose PRISM, a novel framework that, for the first time, explicitly disentangles multimodal sequences into unique, redundant, and synergistic components from an information-theoretic perspective. PRISM further introduces a plug-and-play adaptive fusion mechanism that dynamically weights and integrates multimodal signals according to individual user preferences. The framework is compatible with various backbone sequential recommendation models and demonstrates consistent and significant performance improvements across four benchmark datasets and three distinct backbone architectures, thereby validating its effectiveness and generalizability.
📝 Abstract
Multimodal sequential recommendation (MSR) leverages diverse item modalities to improve recommendation accuracy, while achieving effective and adaptive fusion remains challenging. Existing MSR models often overlook synergistic information that emerges only through modality combinations. Moreover, they typically assume a fixed importance for different modality interactions across users. To address these limitations, we propose \textbf{P}ersonalized \textbf{R}ecommend-ation via \textbf{I}nformation \textbf{S}ynergy \textbf{M}odule (PRISM), a plug-and-play framework for sequential recommendation (SR). PRISM explicitly decomposes multimodal information into unique, redundant, and synergistic components through an Interaction Expert Layer and dynamically weights them via an Adaptive Fusion Layer guided by user preferences. This information-theoretic design enables fine-grained disentanglement and personalized fusion of multimodal signals. Extensive experiments on four datasets and three SR backbones demonstrate its effectiveness and versatility. The code is available at https://github.com/YutongLi2024/PRISM.