MirrorMind: Empowering OmniScientist with the Expert Perspectives and Collective Knowledge of Human Scientists

📅 2025-11-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Contemporary AI scientists model scientific discovery as isolated optimization tasks, neglecting its inherently social and historical nature—specifically, the interplay between individual cognitive trajectories (shaped by research history and stylistic conventions) and collective disciplinary memory (encoded in citation networks and conceptual structures)—which current LLMs fail to model with high fidelity. Method: We propose a hierarchical cognitive architecture that decouples memory storage from agent execution, integrating author-level individual memory, domain-specific concept graphs, and multi-agent collaboration mechanisms to enable dynamic, synergistic access to both individual perspectives and collective knowledge. The framework unifies situational, semantic, and role-based memory modalities. Contribution/Results: Evaluated on four tasks—author-level cognitive simulation, complementary reasoning, interdisciplinary collaboration, and deep insight generation—the architecture significantly enhances deep reasoning and innovative discovery capabilities, establishing a novel paradigm for AI-supported authentic scientific collaboration.

Technology Category

Application Category

📝 Abstract
The emergence of AI Scientists has demonstrated remarkable potential in automating scientific research. However, current approaches largely conceptualize scientific discovery as a solitary optimization or search process, overlooking that knowledge production is inherently a social and historical endeavor. Human scientific insight stems from two distinct yet interconnected sources. First is the individual cognitive trajectory, where a researcher's unique insight is shaped by their evolving research history and stylistic preferences; another is the collective disciplinary memory, where knowledge is sedimented into vast, interconnected networks of citations and concepts. Existing LLMs still struggle to represent these structured, high-fidelity cognitive and social contexts. To bridge this gap, we introduce MirrorMind, a hierarchical cognitive architecture that integrates dual-memory representations within a three-level framework. The Individual Level constructs high-fidelity cognitive models of individual researchers by capturing their episodic, semantic, and persona memories; the Domain Level maps collective knowledge into structured disciplinary concept graphs; and the Interdisciplinary Level that acts as an orthogonal orchestration engine. Crucially, our architecture separates memory storage from agentic execution, enabling AI scientist agents to flexibly access individual memories for unique perspectives or collective structures to reason. We evaluate MirrorMind across four comprehensive tasks, including author-level cognitive simulation, complementary reasoning, cross-disciplinary collaboration promotion, and multi-agent scientific problem solving. The results show that by integrating individual cognitive depth with collective disciplinary breadth, MirrorMind moves beyond simple fact retrieval toward structural, personalized, and insight-generating scientific reasoning.
Problem

Research questions and friction points this paper is trying to address.

AI scientists overlook social historical nature of scientific discovery
Existing models struggle with structured cognitive social contexts
Current approaches treat science as solitary optimization process
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical cognitive architecture with dual-memory representations
Individual researcher modeling through episodic and semantic memories
Structured disciplinary concept graphs for collective knowledge mapping
Q
Qingbin Zeng
Department of Electronic Engineering, BNRist, Tsinghua University
B
Bingbing Fan
Department of Electronic Engineering, BNRist, Tsinghua University
Zhiyu Chen
Zhiyu Chen
Amazon
Conversational AILarge Language ModelsInformation RetrievalNatural language Processing
S
Sijian Ren
Department of Electronic Engineering, BNRist, Tsinghua University
Zhilun Zhou
Zhilun Zhou
Tsinghua University
urban computing
X
Xuhua Zhang
Zhongguancun Academy
Y
Yuanyi Zhen
Zhongguancun Academy
Fengli Xu
Fengli Xu
Tsinghua University
LLM AgentData ScienceSocial ComputingScience of ScienceUrban Science
Y
Yong Li
Department of Electronic Engineering, BNRist, Tsinghua University, Zhongguancun Academy
Tie-Yan Liu
Tie-Yan Liu
President, Zhongguancun Academy | IEEE Fellow | ACM Fellow | AAIA Fellow
Machine learningAI for ScienceAI for IndustryInformation retrievalNLP