🤖 AI Summary
Existing external memory management for large language models relies on hand-crafted heuristics, which struggle with delayed long-term utility and interaction uncertainty. This paper formalizes the problem as a sequential decision-making task under uncertainty and proposes DAM—a decoupled memory operation paradigm that separates immediate access from hierarchical storage maintenance. DAM employs a decision framework grounded in value-function estimation and uncertainty quantification to explicitly balance long-term utility against risk, ensuring both interpretability and scalability. By integrating decision-theoretic modeling, rigorous uncertainty quantification, and policy aggregation, the work establishes a normative foundation for memory management, exposes fundamental limitations of heuristic approaches, and provides a principled design paradigm for uncertainty-aware memory systems.
📝 Abstract
External memory is a key component of modern large language model (LLM) systems, enabling long-term interaction and personalization. Despite its importance, memory management is still largely driven by hand-designed heuristics, offering little insight into the long-term and uncertain consequences of memory decisions. In practice, choices about what to read or write shape future retrieval and downstream behavior in ways that are difficult to anticipate. We argue that memory management should be viewed as a sequential decision-making problem under uncertainty, where the utility of memory is delayed and dependent on future interactions. To this end, we propose DAM (Decision-theoretic Agent Memory), a decision-theoretic framework that decomposes memory management into immediate information access and hierarchical storage maintenance. Within this architecture, candidate operations are evaluated via value functions and uncertainty estimators, enabling an aggregate policy to arbitrate decisions based on estimated long-term utility and risk. Our contribution is not a new algorithm, but a principled reframing that clarifies the limitations of heuristic approaches and provides a foundation for future research on uncertainty-aware memory systems.