Memory Intelligence Agent

📅 2026-04-06
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the limitations of existing deep reasoning agents, which suffer from inefficient memory evolution and high storage and retrieval costs, hindering effective reasoning and autonomous self-improvement. To overcome these challenges, the authors propose the Memory Intelligence Agent (MIA) framework, featuring a Manager-Planner-Executor architecture that innovatively integrates parametric and non-parametric memory with a bidirectional conversion mechanism to enable online memory evolution during reasoning. The framework incorporates alternating reinforcement learning, test-time continual learning, memory compression, search-based planning, and a reflection module coupled with unsupervised judgment to significantly enhance autonomous evolution in open-ended environments. Evaluated across 11 benchmark tasks, the proposed method demonstrates substantial improvements over state-of-the-art approaches in both reasoning efficiency and self-evolution capabilities.
📝 Abstract
Deep research agents (DRAs) integrate LLM reasoning with external tools. Memory systems enable DRAs to leverage historical experiences, which are essential for efficient reasoning and autonomous evolution. Existing methods rely on retrieving similar trajectories from memory to aid reasoning, while suffering from key limitations of ineffective memory evolution and increasing storage and retrieval costs. To address these problems, we propose a novel Memory Intelligence Agent (MIA) framework, consisting of a Manager-Planner-Executor architecture. Memory Manager is a non-parametric memory system that can store compressed historical search trajectories. Planner is a parametric memory agent that can produce search plans for questions. Executor is another agent that can search and analyze information guided by the search plan. To build the MIA framework, we first adopt an alternating reinforcement learning paradigm to enhance cooperation between the Planner and the Executor. Furthermore, we enable the Planner to continuously evolve during test-time learning, with updates performed on-the-fly alongside inference without interrupting the reasoning process. Additionally, we establish a bidirectional conversion loop between parametric and non-parametric memories to achieve efficient memory evolution. Finally, we incorporate a reflection and an unsupervised judgment mechanisms to boost reasoning and self-evolution in the open world. Extensive experiments across eleven benchmarks demonstrate the superiority of MIA.
Problem

Research questions and friction points this paper is trying to address.

memory evolution
storage cost
retrieval cost
deep research agents
reasoning efficiency
Innovation

Methods, ideas, or system contributions that make the work stand out.

Memory Intelligence Agent
test-time learning
parametric memory
non-parametric memory
alternating reinforcement learning
🔎 Similar Papers
No similar papers found.
J
Jingyang Qiao
East China Normal University, Shanghai Innovation Institute, Harbin Institute of Technology, Xiamen University, Shanghai Artificial Intelligence Laboratory, Independent Researcher
W
Weicheng Meng
East China Normal University, Shanghai Innovation Institute, Harbin Institute of Technology, Xiamen University, Shanghai Artificial Intelligence Laboratory, Independent Researcher
Y
Yu Cheng
East China Normal University, Shanghai Innovation Institute, Harbin Institute of Technology, Xiamen University, Shanghai Artificial Intelligence Laboratory, Independent Researcher
Zhihang Lin
Zhihang Lin
Xiamen University & Shanghai Innovation Institute
Efficient Artificial Intelligence
Zhizhong Zhang
Zhizhong Zhang
Associate Researcher, East China Normal University
Computer Vision
Xin Tan
Xin Tan
Research Professor, East China Normal University & Shanghai AI Laboratory
3D VisionTrustworthy Embodied AI
Jingyu Gong
Jingyu Gong
Shanghai Jiao Tong University
3D Computer Vision
Kun Shao
Kun Shao
Huawei
AI Agentreinforcement learningmulti-agent systemsembodied AIgame AI
Yuan Xie
Yuan Xie
Full Professor, School of Computer Science and Technology, East China Normal University
computer vision and image processing