🤖 AI Summary
Existing role-playing agents rely on recurrent summarization-based memory mechanisms, which often lose critical details and struggle to maintain long-term consistency. This work proposes BOOKMARKS—a retrieval-augmented memory framework that explicitly stores task-relevant key information (e.g., concepts, actions, and states) in structured question-answer pairs termed “bookmarks.” By integrating task-driven strategies for active initialization, selective retrieval, and synchronized updates, BOOKMARKS enables efficient and precise memory reuse. Experiments across 85 characters from 16 literary works demonstrate that the proposed approach significantly outperforms existing memory baselines, confirming the effectiveness of a search-oriented memory mechanism in enhancing the consistency of role-playing agents.
📝 Abstract
Memory systems are critical for role-playing agents (RPAs) to maintain long-horizon consistency. However, existing RPA memory methods (e.g., profiling) mainly rely on recurrent summarization, whose compression inevitably discards important details. To address this issue, we propose a search-based memory framework called BOOKMARKS, which actively initializes, maintains, and updates task-relevant pieces of bookmarks for the current task (e.g., character acting). A bookmark is structured as the answer to a question at a specific point in the storyline. For each current task, BOOKMARKS selects reusable existing bookmarks or initializes new ones (at storyline beginning) with useful questions. These bookmarks are then synchronized to the current story point, with their answers updated accordingly, so they can be efficiently reused in future grounding rounds. Compared with recurrent summarization, BOOKMARKS offers (1) active grounding for capturing task-specific details and (2) passive updating to avoid unnecessary computation. In implementation, BOOKMARKS supports concept, behavior, and state searches, each powered by an efficient synchronization method. BOOKMARKS significantly outperforms RPA memory baselines on 85 characters from 16 artifacts, demonstrating the effectiveness of search-based memory for RPAs.