🤖 AI Summary
Contemporary multimedia recommendation systems rely heavily on media similarity modeling, yet their embedding representations suffer from limited semantic fidelity and interpretability due to insufficient attention to representational expressiveness and interaction-level value assessment. To address this, we propose RVRec—a model-agnostic, plug-and-play embedding enhancement framework. First, it introduces a negative 2-Wasserstein distance contrastive loss to strengthen the semantic discriminability of embeddings. Second, it reweights user–media interactions via multivariate Shapley values, explicitly quantifying each interaction’s contribution to personalized recommendation. By integrating probabilistic embedding optimization with interpretability-driven contrastive learning, RVRec consistently improves both recommendation accuracy and transparency across multiple state-of-the-art base models and real-world benchmarks, outperforming all SOTA baselines.
📝 Abstract
Existing multimedia recommender systems provide users with suggestions of media by evaluating the similarities, such as games and movies. To enhance the semantics and explainability of embeddings, it is a consensus to apply additional information (e.g., interactions, contexts, popularity). However, without systematic consideration of representativeness and value, the utility and explainability of embedding drops drastically. Hence, we introduce RVRec, a plug-and-play model-agnostic embedding enhancement approach that can improve both personality and explainability of existing systems. Specifically, we propose a probability-based embedding optimization method that uses a contrastive loss based on negative 2-Wasserstein distance to learn to enhance the representativeness of the embeddings. In addtion, we introduce a reweighing method based on multivariate Shapley values strategy to evaluate and explore the value of interactions and embeddings. Extensive experiments on multiple backbone recommenders and real-world datasets show that RVRec can improve the personalization and explainability of existing recommenders, outperforming state-of-the-art baselines.