Text2Mem: A Unified Memory Operation Language for Memory Operating System

📅 2025-09-14

📈 Citations: 0

✨ Influential: 0

career value

199K/year

🤖 AI Summary

Existing LLM agent memory frameworks support only basic operations (e.g., encode, retrieve, delete) and lack advanced capabilities such as merging, promoting, or demoting memories; moreover, memory commands lack formal specification, leading to unpredictable behavior and poor cross-system interoperability. Method: We propose Text2Mem—the first standardized memory operation language—featuring a JSON Schema–defined instruction set and semantic invariants that establish an end-to-end path from natural language directives to deterministic execution. Its three-layer architecture (parsing–validation–adaptation) decouples instruction generation from execution, supporting the full spectrum of operations (encode, retrieve, merge, promote, etc.). A unified execution contract integrates embedding and summarization models, backed by an extensible SQL prototype. Contribution/Results: Text2Mem ensures safety, determinism, and portability across heterogeneous backends. We further introduce Text2Mem Bench, a benchmark suite enabling systematic evaluation of memory operation frameworks.

Technology Category

Application Category

📝 Abstract

Large language model agents increasingly depend on memory to sustain long horizon interaction, but existing frameworks remain limited. Most expose only a few basic primitives such as encode, retrieve, and delete, while higher order operations like merge, promote, demote, split, lock, and expire are missing or inconsistently supported. Moreover, there is no formal and executable specification for memory commands, leaving scope and lifecycle rules implicit and causing unpredictable behavior across systems. We introduce Text2Mem, a unified memory operation language that provides a standardized pathway from natural language to reliable execution. Text2Mem defines a compact yet expressive operation set aligned with encoding, storage, and retrieval. Each instruction is represented as a JSON based schema instance with required fields and semantic invariants, which a parser transforms into typed operation objects with normalized parameters. A validator ensures correctness before execution, while adapters map typed objects either to a SQL prototype backend or to real memory frameworks. Model based services such as embeddings or summarization are integrated when required. All results are returned through a unified execution contract. This design ensures safety, determinism, and portability across heterogeneous backends. We also outline Text2Mem Bench, a planned benchmark that separates schema generation from backend execution to enable systematic evaluation. Together, these components establish the first standardized foundation for memory control in agents.

Problem

Research questions and friction points this paper is trying to address.

Standardizing memory operations for LLM agents

Providing formal specification for memory commands

Ensuring cross-system safety and portability

Innovation

Methods, ideas, or system contributions that make the work stand out.

JSON-based schema for memory operations

Validator ensures execution correctness

Adapters map to SQL and frameworks

🔎 Similar Papers

Virtuoso: Enabling Fast and Accurate Virtual Memory Research via an Imitation-based Operating System Simulation Methodology