🤖 AI Summary
Existing collaborative filtering methods suffer from limited capability in fine-grained preference modeling and explanation generation, failing to jointly optimize rating prediction accuracy and personalized interpretability. To address this, we propose a multi-task explainable recommendation framework that jointly learns global and aspect-level user–item representations. A personalized attention mechanism is introduced to dynamically weight the importance of different aspects according to individual user preferences. Leveraging T5-small, the model simultaneously optimizes three objectives: overall rating prediction, aspect-level rating prediction, and personalized review generation. Extensive experiments on TripAdvisor and RateBeer demonstrate that our approach significantly outperforms strong baselines—particularly in generating semantically coherent, user-specific, high-quality natural language explanations. To the best of our knowledge, this is the first work to achieve deep joint modeling of precise rating prediction and fine-grained, natural-language-based explanations.
📝 Abstract
Collaborative filtering drives many successful recommender systems but struggles with fine-grained user-item interactions and explainability. As users increasingly seek transparent recommendations, generating textual explanations through language models has become a critical research area. Existing methods employ either RNNs or Transformers. However, RNN-based approaches fail to leverage the capabilities of pre-trained Transformer models, whereas Transformer-based methods often suffer from suboptimal adaptation and neglect aspect modeling, which is crucial for personalized explanations. We propose ELIXIR (Efficient and LIghtweight model for eXplaIning Recommendations), a multi-task model combining rating prediction with personalized review generation. ELIXIR jointly learns global and aspect-specific representations of users and items, optimizing overall rating, aspect-level ratings, and review generation, with personalized attention to emphasize aspect importance. Based on a T5-small (60M) model, we demonstrate the effectiveness of our aspect-based architecture in guiding text generation in a personalized context, where state-of-the-art approaches exploit much larger models but fail to match user preferences as well. Experimental results on TripAdvisor and RateBeer demonstrate that ELIXIR significantly outperforms strong baseline models, especially in review generation.