Towards Proactive Personalization through Profile Customization for Individual Users in Dialogues

📅 2025-12-17
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing LLM alignment methods focus on static, single-turn, or universal value alignment, failing to model users’ long-term personalized preferences and address cold-start challenges. This paper proposes PersonalAgent—the first proactive dialogue agent that formalizes cross-session personalization as a sequential reasoning task. It integrates LLM-driven dialogue decomposition, temporal preference modeling, dynamic user profile updating, and reinforcement learning–based policy optimization to track preference evolution and mitigate cold start. Its key innovation lies in establishing the first cross-session consistent sequential alignment framework, inherently robust to dialogue noise. Experiments demonstrate significant improvements over prompt-based and policy-optimization baselines under both ideal and noisy dialogue conditions. Human evaluation confirms that PersonalAgent achieves state-of-the-art performance in naturalness and consistency of preference understanding.

Technology Category

Application Category

📝 Abstract
The deployment of Large Language Models (LLMs) in interactive systems necessitates a deep alignment with the nuanced and dynamic preferences of individual users. Current alignment techniques predominantly address universal human values or static, single-turn preferences, thereby failing to address the critical needs of long-term personalization and the initial user cold-start problem. To bridge this gap, we propose PersonalAgent, a novel user-centric lifelong agent designed to continuously infer and adapt to user preferences. PersonalAgent constructs and dynamically refines a unified user profile by decomposing dialogues into single-turn interactions, framing preference inference as a sequential decision-making task. Experiments show that PersonalAgent achieves superior performance over strong prompt-based and policy optimization baselines, not only in idealized but also in noisy conversational contexts, while preserving cross-session preference consistency. Furthermore, human evaluation confirms that PersonalAgent excels at capturing user preferences naturally and coherently. Our findings underscore the importance of lifelong personalization for developing more inclusive and adaptive conversational agents. Our code is available here.
Problem

Research questions and friction points this paper is trying to address.

Enhances user personalization in LLM dialogues
Addresses long-term preference adaptation and cold-start
Improves cross-session consistency in noisy contexts
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dynamic user profile construction via dialogue decomposition
Sequential decision-making for continuous preference inference
Lifelong agent adaptation to preserve cross-session consistency
🔎 Similar Papers
No similar papers found.
X
Xiaotian Zhang
Zhejiang University, Zhejiang Key Laboratory of Medical Imaging Artificial Intelligence
Y
Yuan Wang
Zhejiang University, Zhejiang Key Laboratory of Medical Imaging Artificial Intelligence
Ruizhe Chen
Ruizhe Chen
Zhejiang University
LLMMLLM
Z
Zeya Wang
Zhejiang University
R
Runchen Hou
Zhejiang University
Zuozhu Liu
Zuozhu Liu
Assistant Professor, Zhejiang University/University of Illinois Urbana-Champaign
deep learningvision-language modelsmedical AI