Towards Proactive Personalization through Profile Customization for Individual Users in Dialogues

📅 2025-12-17

📈 Citations: 0

✨ Influential: 0

career value

185K/year

🤖 AI Summary

Existing LLM alignment methods focus on static, single-turn, or universal value alignment, failing to model users’ long-term personalized preferences and address cold-start challenges. This paper proposes PersonalAgent—the first proactive dialogue agent that formalizes cross-session personalization as a sequential reasoning task. It integrates LLM-driven dialogue decomposition, temporal preference modeling, dynamic user profile updating, and reinforcement learning–based policy optimization to track preference evolution and mitigate cold start. Its key innovation lies in establishing the first cross-session consistent sequential alignment framework, inherently robust to dialogue noise. Experiments demonstrate significant improvements over prompt-based and policy-optimization baselines under both ideal and noisy dialogue conditions. Human evaluation confirms that PersonalAgent achieves state-of-the-art performance in naturalness and consistency of preference understanding.

Technology Category

Application Category

📝 Abstract

The deployment of Large Language Models (LLMs) in interactive systems necessitates a deep alignment with the nuanced and dynamic preferences of individual users. Current alignment techniques predominantly address universal human values or static, single-turn preferences, thereby failing to address the critical needs of long-term personalization and the initial user cold-start problem. To bridge this gap, we propose PersonalAgent, a novel user-centric lifelong agent designed to continuously infer and adapt to user preferences. PersonalAgent constructs and dynamically refines a unified user profile by decomposing dialogues into single-turn interactions, framing preference inference as a sequential decision-making task. Experiments show that PersonalAgent achieves superior performance over strong prompt-based and policy optimization baselines, not only in idealized but also in noisy conversational contexts, while preserving cross-session preference consistency. Furthermore, human evaluation confirms that PersonalAgent excels at capturing user preferences naturally and coherently. Our findings underscore the importance of lifelong personalization for developing more inclusive and adaptive conversational agents. Our code is available here.

Problem

Research questions and friction points this paper is trying to address.

Enhances user personalization in LLM dialogues

Addresses long-term preference adaptation and cold-start

Improves cross-session consistency in noisy contexts

Innovation

Methods, ideas, or system contributions that make the work stand out.

Dynamic user profile construction via dialogue decomposition

Sequential decision-making for continuous preference inference

Lifelong agent adaptation to preserve cross-session consistency

🔎 Similar Papers

No similar papers found.

OpenAI

$380K – $445K • Offers Equity

San Francisco, CA, USA

Tech Lead Manager, Large Language Models & Generative AI

ByteDance

圣何塞

Research Engineer, Language - Personalization, Meta Superintelligence Labs