MEMENTO: Leveraging Web as a Learning Signal for Low-Data Domains

📅 2026-05-28

📈 Citations: 0

✨ Influential: 0

career value

186K/year

🤖 AI Summary

This work addresses the challenge of limited labeled data in specialized domains, where models struggle to acquire task-specific expertise. The authors propose a training-free continual learning framework that uniquely leverages the interaction process itself as a learning signal. By introducing an Adaptive Exploration Tree (AET) for intra-conversation policy exploration and a dual-channel memory system that separately stores declarative (factual) and procedural (operational) knowledge, the method enables cross-session knowledge reuse and performance improvement. Evaluated on two low-data tasks—sales automation and legal research—the approach outperforms the ReAct baseline by 25.6% and 36.5%, respectively, demonstrating its effectiveness and generalization capability.

📝 Abstract

Real-world tasks often lack large labeled datasets, motivating extensive work on learning in low-data regimes. However, existing approaches such as few-shot prompting, instruction tuning, and synthetic data generation, continue to treat labeled or pseudo-labeled data as the primary learning signal. In contrast, human practitioners acquire expertise through repeated, self-directed interaction with the open web, progressively refining both domain knowledge and search strategies. We propose MEMENTO, a framework that treats the web as a learning signal rather than a stateless retrieval interface. MEMENTO operates at two levels: within each session, it conducts iterative web exploration via an Adaptive Exploration Tree (AET) that decomposes tasks into evolving questions and reflects on intermediate findings; across sessions, it accumulates experience through dual-channel memory, separating declarative knowledge (facts) from procedural knowledge (search strategies). This design enables agents to learn reusable research strategies and domain expertise from trajectories of web interaction without additional model training. We evaluate MEMENTO on two low-data professional domains: sales automation and legal research. Our empirical results show consistent improvements in performance over ReAct based baselines (+25.6% on sales automation and 36.5% on legal research), demonstrating that the web can serve as a scalable learning source for acquiring task-specific expertise in data-scarce settings.

Problem

Research questions and friction points this paper is trying to address.

low-data domains

web as learning signal

task-specific expertise

data scarcity

professional domains

Innovation

Methods, ideas, or system contributions that make the work stand out.

web-as-learning-signal

adaptive exploration tree

dual-channel memory