Revisiting scalable sequential recommendation with Multi-Embedding Approach and Mixture-of-Experts

📅 2025-10-29
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address scalability bottlenecks in sequential recommendation arising from modeling item multi-dimensional features and dynamic user-context relevance, this paper proposes Fuxi-MME—a novel framework that decouples a monolithic embedding into multiple low-dimensional, task-specific embeddings. It integrates a Mixture-of-Experts (MoE) mechanism with a lightweight Fuxi Block to enable feature-level specialized representation learning and dynamic gated fusion. Unlike conventional single-embedding + fully connected architectures, Fuxi-MME reduces the parameter growth order significantly while preserving high representational capacity, thereby improving both training and inference efficiency. Extensive experiments on multiple public sequential recommendation benchmarks demonstrate that Fuxi-MME consistently outperforms state-of-the-art baselines—including SASRec, BERT4Rec, and GRU4Rec—with substantially fewer parameters. This validates its dual advantages in recommendation accuracy and model scalability.

Technology Category

Application Category

📝 Abstract
In recommendation systems, how to effectively scale up recommendation models has been an essential research topic. While significant progress has been made in developing advanced and scalable architectures for sequential recommendation(SR) models, there are still challenges due to items' multi-faceted characteristics and dynamic item relevance in the user context. To address these issues, we propose Fuxi-MME, a framework that integrates a multi-embedding strategy with a Mixture-of-Experts (MoE) architecture. Specifically, to efficiently capture diverse item characteristics in a decoupled manner, we decompose the conventional single embedding matrix into several lower-dimensional embedding matrices. Additionally, by substituting relevant parameters in the Fuxi Block with an MoE layer, our model achieves adaptive and specialized transformation of the enriched representations. Empirical results on public datasets show that our proposed framework outperforms several competitive baselines.
Problem

Research questions and friction points this paper is trying to address.

Scaling sequential recommendation models effectively
Capturing items' multi-faceted characteristics dynamically
Addressing dynamic item relevance in user context
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-embedding strategy captures diverse item characteristics
Mixture-of-Experts architecture enables adaptive representation transformation
Decomposed embedding matrices efficiently model multifaceted item properties
🔎 Similar Papers
No similar papers found.
Q
Qiushi Pan
University of Science and Technology of China
H
Hao Wang
University of Science and Technology of China
G
Guoyuan An
Huawei Noah’s Ark Lab
Luankang Zhang
Luankang Zhang
University of Science and Technology of China
RSLLMs4Rec
W
Wei Guo
Huawei Noah’s Ark Lab
Y
Yong Liu
Huawei Noah’s Ark Lab