🤖 AI Summary
This work addresses the limitation of existing test-time augmentation (TTA) methods in sequential recommendation, which employ uniform strategies and overlook the heterogeneity of user behaviors, thereby constraining performance. To overcome this, the authors propose AdaTTA, a reinforcement learning–based adaptive inference framework that, for the first time, formulates the selection of augmentation operators as a Markov decision process. AdaTTA leverages an Actor-Critic architecture with hybrid state representations and a joint macro-ranking reward to dynamically determine the optimal augmentation strategy for each user. The framework introduces a plug-and-play adaptive mechanism that transcends the conventional one-size-fits-all paradigm. Extensive experiments demonstrate that AdaTTA significantly outperforms the best fixed-strategy baselines across four real-world datasets and two backbone models, achieving a relative improvement of up to 26.31% on the Home dataset while maintaining manageable computational overhead.
📝 Abstract
Test-time augmentation (TTA) has become a promising approach for mitigating data sparsity in sequential recommendation by improving inference accuracy without requiring costly model retraining. However, existing TTA methods typically rely on uniform, user-agnostic augmentation strategies. We show that this "one-size-fits-all" design is inherently suboptimal, as it neglects substantial behavioral heterogeneity across users, and empirically demonstrate that the optimal augmentation operators vary significantly across user sequences with different characteristics for the first time. To address this limitation, we propose AdaTTA, a plug-and-play reinforcement learning-based adaptive inference framework that learns to select sequence-specific augmentation operators on a per-sequence basis. We formulate augmentation selection as a Markov Decision Process and introduce an Actor-Critic policy network with hybrid state representations and a joint macro-rank reward design to dynamically determine the optimal operator for each input user sequence. Extensive experiments on four real-world datasets and two recommendation backbones demonstrate that AdaTTA consistently outperforms the best fixed-strategy baselines, achieving up to 26.31% relative improvement on the Home dataset while incurring only moderate computational overhead