FindRec: Stein-Guided Entropic Flow for Multi-Modal Sequential Recommendation

📅 2025-07-07
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the challenges of modeling temporal dynamics, heterogeneous feature distribution shifts, and severe noise interference in multimodal sequential recommendation, this paper proposes a unified information disentanglement framework. Methodologically: (1) a Stein kernel-driven ensemble coordination module aligns distributions between multimodal features and ID embeddings; (2) a cross-modal expert routing mechanism adaptively selects and fuses context-aware features; (3) a hybrid architecture integrates multi-head subspace decomposition, RBF-based Stein gradient estimation, linear-complexity Mamba structures, and an information-flow-controlled output paradigm to balance modeling efficiency and stability. Evaluated on three real-world datasets, the model significantly outperforms state-of-the-art methods—particularly under long-sequence and high-noise conditions—demonstrating superior robustness, improved recommendation accuracy, and enhanced interpretability.

Technology Category

Application Category

📝 Abstract
Modern recommendation systems face significant challenges in processing multimodal sequential data, particularly in temporal dynamics modeling and information flow coordination. Traditional approaches struggle with distribution discrepancies between heterogeneous features and noise interference in multimodal signals. We propose extbf{FindRec}~ ( extbf{F}lexible unified extbf{in}formation extbf{d}isentanglement for multi-modal sequential extbf{Rec}ommendation), introducing a novel "information flow-control-output" paradigm. The framework features two key innovations: (1) A Stein kernel-based Integrated Information Coordination Module (IICM) that theoretically guarantees distribution consistency between multimodal features and ID streams, and (2) A cross-modal expert routing mechanism that adaptively filters and combines multimodal features based on their contextual relevance. Our approach leverages multi-head subspace decomposition for routing stability and RBF-Stein gradient for unbiased distribution alignment, enhanced by linear-complexity Mamba layers for efficient temporal modeling. Extensive experiments on three real-world datasets demonstrate FindRec's superior performance over state-of-the-art baselines, particularly in handling long sequences and noisy multimodal inputs. Our framework achieves both improved recommendation accuracy and enhanced model interpretability through its modular design. The implementation code is available anonymously online for easy reproducibility~footnote{https://github.com/Applied-Machine-Learning-Lab/FindRec}.
Problem

Research questions and friction points this paper is trying to address.

Handles multimodal sequential data challenges in recommendations
Resolves distribution discrepancies in heterogeneous features
Improves accuracy and interpretability in noisy inputs
Innovation

Methods, ideas, or system contributions that make the work stand out.

Stein kernel-based distribution alignment module
Cross-modal expert routing for relevance filtering
Mamba layers for efficient temporal modeling
🔎 Similar Papers
No similar papers found.
M
Maolin Wang
City University of Hong Kong, Hong Kong SAR, China
Y
Yutian Xiao
Beihang University, Beijing, China
B
Binhao Wang
City University of Hong Kong, Hong Kong SAR, China
S
Sheng Zhang
City University of Hong Kong, Hong Kong SAR, China
S
Shanshan Ye
University of Technology Sydney, Sydney, New South Wales, Australia
W
Wanyu Wang
City University of Hong Kong, Hong Kong SAR, China
Hongzhi Yin
Hongzhi Yin
Professor and ARC Future Fellow, University of Queensland
Recommender SystemGraph LearningSpatial-temporal PredictionEdge IntelligenceLLM
Ruocheng Guo
Ruocheng Guo
Intuit AI Research
LLMsCausal MLData Mining
Zenglin Xu
Zenglin Xu
Fudan University
Machine LearningTrustworthy AIFederated LearningLarge Language ModelsTime Series Analysis