Intrinsically-Motivated Humans and Agents in Open-World Exploration

📅 2025-03-31

📈 Citations: 0

✨ Influential: 0

career value

245K/year

🤖 AI Summary

This study investigates how intrinsic motivation drives divergent exploration behaviors between humans (adults and children) and AI agents in the open-world environment Crafter. Method: We conducted behavioral experiments, collected private speech annotations, and computationally modeled three intrinsic motivation signals—entropy, information gain, and empowerment—to systematically quantify and compare cross-subject exploration dynamics. Contribution/Results: We find, for the first time, that entropy and empowerment consistently correlate positively with human exploration progress: entropy primarily governs early-stage state diversity, while empowerment underpins later-stage control acquisition; information gain shows weaker associations. Notably, children’s goal-directed verbalizations significantly enhance exploration efficiency. Critically, the entropy–empowerment synergy better captures human exploration logic than either metric alone, providing empirical grounding and a computationally tractable pathway for designing human-like exploration strategies in embodied AI agents.

Technology Category

Application Category

📝 Abstract

What drives exploration? Understanding intrinsic motivation is a long-standing challenge in both cognitive science and artificial intelligence; numerous objectives have been proposed and used to train agents, yet there remains a gap between human and agent exploration. We directly compare adults, children, and AI agents in a complex open-ended environment, Crafter, and study how common intrinsic objectives: Entropy, Information Gain, and Empowerment, relate to their behavior. We find that only Entropy and Empowerment are consistently positively correlated with human exploration progress, indicating that these objectives may better inform intrinsic reward design for agents. Furthermore, across agents and humans we observe that Entropy initially increases rapidly, then plateaus, while Empowerment increases continuously, suggesting that state diversity may provide more signal in early exploration, while advanced exploration should prioritize control. Finally, we find preliminary evidence that private speech utterances, and particularly goal verbalizations, may aid exploration in children.

Problem

Research questions and friction points this paper is trying to address.

Compare human and AI exploration in open-world environments

Identify intrinsic motivations (Entropy, Empowerment) linked to human exploration

Study role of verbalizations in children's exploration strategies

Innovation

Methods, ideas, or system contributions that make the work stand out.

Compare humans and AI in open-world exploration

Use Entropy and Empowerment as intrinsic rewards

Study verbalizations aiding child exploration

🔎 Similar Papers

RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning

2024-05-29arXiv.orgCitations: 0

Bosch Group

Renningen, BW, DE

AI Research Scientist - FAIR Social Intelligence