A Probabilistic Framework for Temporal Distribution Generalization in Industry-Scale Recommender Systems

📅 2025-11-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Recommendation systems suffer from temporal distribution shift (TDS), leading to long-term performance degradation. Existing invariant learning and self-supervised approaches exhibit unstable temporal generalization, representation collapse, and suboptimal data utilization. To address these issues, we propose the first probabilistic framework tailored for industrial-scale incremental learning, uniquely integrating statistical time-series analysis with causal graph modeling. We design a causal-structure-aware variational self-supervised objective—ELBO$_ ext{TDS}$—and introduce a time-varying factor resampling strategy to enhance temporal robustness. Deployed in Shopee’s product search system, our framework effectively mitigates representation degradation and significantly improves temporal generalization. Theoretical analysis establishes its consistency under TDS, while extensive experiments demonstrate superior temporal adaptability. Online A/B testing shows a 2.33% lift in average GMV per user, validating both practical efficacy and scalability.

Technology Category

Application Category

📝 Abstract
Temporal distribution shift (TDS) erodes the long-term accuracy of recommender systems, yet industrial practice still relies on periodic incremental training, which struggles to capture both stable and transient patterns. Existing approaches such as invariant learning and self-supervised learning offer partial solutions but often suffer from unstable temporal generalization, representation collapse, or inefficient data utilization. To address these limitations, we propose ELBO$_ ext{TDS}$, a probabilistic framework that integrates seamlessly into industry-scale incremental learning pipelines. First, we identify key shifting factors through statistical analysis of real-world production data and design a simple yet effective data augmentation strategy that resamples these time-varying factors to extend the training support. Second, to harness the benefits of this extended distribution while preventing representation collapse, we model the temporal recommendation scenario using a causal graph and derive a self-supervised variational objective, ELBO$_ ext{TDS}$, grounded in the causal structure. Extensive experiments supported by both theoretical and empirical analysis demonstrate that our method achieves superior temporal generalization, yielding a 2.33% uplift in GMV per user and has been successfully deployed in Shopee Product Search. Code is available at https://github.com/FuCongResearchSquad/ELBO4TDS.
Problem

Research questions and friction points this paper is trying to address.

Addresses temporal distribution shift degrading recommender system accuracy
Overcomes unstable generalization and representation collapse in existing methods
Enhances incremental learning for both stable and transient user patterns
Innovation

Methods, ideas, or system contributions that make the work stand out.

Probabilistic framework for temporal distribution generalization
Data augmentation resampling time-varying factors
Self-supervised variational objective from causal structure
🔎 Similar Papers
No similar papers found.
Yuxuan Zhu
Yuxuan Zhu
PhD student, University of Illinois Urbana-Champaign
Data systemsAI evaluation
Cong Fu
Cong Fu
Texas A&M University, Computer Science
Geometric Deep LearningAI for SciencePhysical SimulationsMoleculesQuantum Many-Body Physics
Y
Yabo Ni
Nanyang Technological University, Singapore, Singapore
A
Anxiang Zeng
Nanyang Technological University, Singapore, Singapore
Y
Yuan Fang
Singapore Management University, Singapore, Singapore