GRAM: Generative Recommendation via Semantic-aware Multi-granular Late Fusion

📅 2025-06-02

📈 Citations: 0

✨ Influential: 0

career value

158K/year

🤖 AI Summary

Generative recommendation faces two key challenges: difficulty in modeling implicit item relationships and inefficient utilization of long textual information. To address these, we propose a semantic-aware multi-granularity post-fusion framework. Our approach introduces a novel “semantic → vocabulary” hierarchical relationship encoding mechanism to explicitly capture implicit inter-item associations. Additionally, we design a decoding-stage-delayed multi-granularity prompt fusion strategy, enabling hierarchical collaborative relationship modeling and late-fusion integration of heterogeneous prompts. Built upon the large language model (LLM)-based text generation paradigm, our method balances semantic richness with information completeness. Extensive experiments on four benchmark datasets demonstrate substantial improvements over eight state-of-the-art baselines: Recall@5 increases by 11.5–16.0%, and NDCG@5 improves by 5.3–13.6%.

Technology Category

Application Category

📝 Abstract

Generative recommendation is an emerging paradigm that leverages the extensive knowledge of large language models by formulating recommendations into a text-to-text generation task. However, existing studies face two key limitations in (i) incorporating implicit item relationships and (ii) utilizing rich yet lengthy item information. To address these challenges, we propose a Generative Recommender via semantic-Aware Multi-granular late fusion (GRAM), introducing two synergistic innovations. First, we design semantic-to-lexical translation to encode implicit hierarchical and collaborative item relationships into the vocabulary space of LLMs. Second, we present multi-granular late fusion to integrate rich semantics efficiently with minimal information loss. It employs separate encoders for multi-granular prompts, delaying the fusion until the decoding stage. Experiments on four benchmark datasets show that GRAM outperforms eight state-of-the-art generative recommendation models, achieving significant improvements of 11.5-16.0% in Recall@5 and 5.3-13.6% in NDCG@5. The source code is available at https://github.com/skleee/GRAM.

Problem

Research questions and friction points this paper is trying to address.

Incorporating implicit item relationships into generative recommendation models

Efficiently utilizing rich yet lengthy item information

Improving generative recommendation performance via semantic-aware fusion

Innovation

Methods, ideas, or system contributions that make the work stand out.

Semantic-to-lexical translation for item relationships

Multi-granular late fusion for rich semantics

Separate encoders for multi-granular prompts

🔎 Similar Papers

No similar papers found.