Pctx: Tokenizing Personalized Context for Generative Recommendation

πŸ“… 2025-10-24
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Existing generative recommendation models employ static, non-personalized tokenization strategies that assign uniform semantic IDs to items solely based on item features, ignoring user-specific intent and thus failing to produce contextually differentiated item representations across users. Method: We propose the first personalized, context-aware tokenization method for generative recommendation. It dynamically constructs a learnable tokenizer from each user’s historical interaction sequence, mapping items into user-specific semantic IDs and enabling dynamic semantic encoding with multi-faceted interpretability. The method is seamlessly integrated into an autoregressive generative framework, preserving both discrete ID modeling and sequential generation capabilities. Contribution/Results: Evaluated on three public benchmarks, our approach achieves up to 11.44% improvement in NDCG@10 over non-personalized baselines. It is the first to incorporate personalized contextual information into the tokenization stage of generative recommendation, establishing a new paradigm for user-adaptive semantic representation learning.

Technology Category

Application Category

πŸ“ Abstract
Generative recommendation (GR) models tokenize each action into a few discrete tokens (called semantic IDs) and autoregressively generate the next tokens as predictions, showing advantages such as memory efficiency, scalability, and the potential to unify retrieval and ranking. Despite these benefits, existing tokenization methods are static and non-personalized. They typically derive semantic IDs solely from item features, assuming a universal item similarity that overlooks user-specific perspectives. However, under the autoregressive paradigm, semantic IDs with the same prefixes always receive similar probabilities, so a single fixed mapping implicitly enforces a universal item similarity standard across all users. In practice, the same item may be interpreted differently depending on user intentions and preferences. To address this issue, we propose a personalized context-aware tokenizer that incorporates a user's historical interactions when generating semantic IDs. This design allows the same item to be tokenized into different semantic IDs under different user contexts, enabling GR models to capture multiple interpretive standards and produce more personalized predictions. Experiments on three public datasets demonstrate up to 11.44% improvement in NDCG@10 over non-personalized action tokenization baselines. Our code is available at https://github.com/YoungZ365/Pctx.
Problem

Research questions and friction points this paper is trying to address.

Addresses static tokenization lacking personalization in generative recommendation
Enables dynamic semantic IDs based on user context and preferences
Improves recommendation personalization through contextual action tokenization
Innovation

Methods, ideas, or system contributions that make the work stand out.

Personalized tokenizer incorporates user historical interactions
Dynamic semantic IDs adapt to different user contexts
Enables multiple item interpretation standards for personalization
πŸ”Ž Similar Papers