Mem-PAL: Towards Memory-based Personalized Dialogue Assistants for Long-term User-Agent Interaction

📅 2025-11-17

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

Existing approaches struggle to model users’ subjective traits and evolving preferences in long-term human–machine dialogues, resulting in inadequate personalization of service responses. To address this, we propose PAL-Bench—the first Chinese multi-session personalized dialogue benchmark—and construct PAL-Set, a large-scale dataset comprising real-world user logs and dialogue histories. We introduce H²Memory, a hierarchical heterogeneous memory framework that integrates retrieval-augmented generation (RAG) to dynamically model user characteristics and preserve long-term memory. High-quality training data are generated via an LLM-driven multi-step synthetic pipeline and rigorously validated by human annotators. Experiments demonstrate that H²Memory significantly improves personalized response quality on PAL-Bench and multiple external benchmarks, effectively supporting user modeling and service adaptation in extended interactions.

Technology Category

Application Category

📝 Abstract

With the rise of smart personal devices, service-oriented human-agent interactions have become increasingly prevalent. This trend highlights the need for personalized dialogue assistants that can understand user-specific traits to accurately interpret requirements and tailor responses to individual preferences. However, existing approaches often overlook the complexities of long-term interactions and fail to capture users' subjective characteristics. To address these gaps, we present PAL-Bench, a new benchmark designed to evaluate the personalization capabilities of service-oriented assistants in long-term user-agent interactions. In the absence of available real-world data, we develop a multi-step LLM-based synthesis pipeline, which is further verified and refined by human annotators. This process yields PAL-Set, the first Chinese dataset comprising multi-session user logs and dialogue histories, which serves as the foundation for PAL-Bench. Furthermore, to improve personalized service-oriented interactions, we propose H$^2$Memory, a hierarchical and heterogeneous memory framework that incorporates retrieval-augmented generation to improve personalized response generation. Comprehensive experiments on both our PAL-Bench and an external dataset demonstrate the effectiveness of the proposed memory framework.

Problem

Research questions and friction points this paper is trying to address.

Personalized dialogue assistants lack long-term user trait understanding

Existing approaches fail to capture subjective characteristics in extended interactions

Need improved memory frameworks for personalized service-oriented dialogue systems

Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-step LLM synthesis pipeline for data generation

Hierarchical heterogeneous memory framework H2Memory

Retrieval-augmented generation for personalized responses

🔎 Similar Papers

Hello Again! LLM-powered Personalized Agent for Long-term Dialogue

2024-06-09arXiv.orgCitations: 9

Authors to Follow