Centaur: a foundation model of human cognition

📅 2024-10-26
🏛️ arXiv.org
📈 Citations: 10
Influential: 1
📄 PDF
🤖 AI Summary
This study aims to develop the first foundational cognitive model capable of generalizing across human psychological behaviors, thereby advancing the computational formalization of a unified cognitive theory. Method: We fine-tune large language models on Psych-101—a massive-scale natural-language dataset of psychological experiments (10+ million trials across 160 paradigms)—and introduce two key innovations: natural-language task formalization and neural representation alignment. Contribution/Results: Our model achieves, for the first time, zero-shot behavioral prediction across distinct experiments, tasks, and cognitive domains. It significantly outperforms classical cognitive models in held-out participant prediction accuracy. Crucially, its learned behavioral representations exhibit high fidelity to human neural activity measured via fMRI and MEG. As the first scalable, interpretable, and empirically falsifiable computational framework for cognitive science, it bridges symbolic cognitive modeling with neural data, enabling rigorous hypothesis testing and theory-driven AI development.

Technology Category

Application Category

📝 Abstract
Establishing a unified theory of cognition has been a major goal of psychology. While there have been previous attempts to instantiate such theories by building computational models, we currently do not have one model that captures the human mind in its entirety. A first step in this direction is to create a model that can predict human behavior in a wide range of settings. Here we introduce Centaur, a computational model that can predict and simulate human behavior in any experiment expressible in natural language. We derived Centaur by finetuning a state-of-the-art language model on a novel, large-scale data set called Psych-101. Psych-101 reaches an unprecedented scale, covering trial-by-trial data from over 60,000 participants performing over 10,000,000 choices in 160 experiments. Centaur not only captures the behavior of held-out participants better than existing cognitive models, but also generalizes to new cover stories, structural task modifications, and entirely new domains. Furthermore, we find that the model's internal representations become more aligned with human neural activity after finetuning. Taken together, our results demonstrate that it is possible to discover computational models that capture human behavior across a wide range of domains. We believe that such models provide tremendous potential for guiding the development of cognitive theories and present a case study to demonstrate this.
Problem

Research questions and friction points this paper is trying to address.

Creating a unified computational model of human cognition
Predicting human behavior across diverse experimental settings
Aligning model representations with human neural activity patterns
Innovation

Methods, ideas, or system contributions that make the work stand out.

Finetuned state-of-the-art language model
Large-scale Psych-101 dataset training
Predicts human behavior universally
🔎 Similar Papers
No similar papers found.
Marcel Binz
Marcel Binz
Helmholtz Munich
cognitive sciencemachine learninglarge language modelsautomated sciencein-context learning
Elif Akata
Elif Akata
Helmholtz Munich, University of Tübingen
machine learningcognitive science
Matthias Bethge
Matthias Bethge
Tübingen University & Maddox Co-Founder
Computational NeuroscienceMachine LearningVision
F
Franziska Brandle
University of Oxford, Max Planck Institute for Biological Cybernetics
F
Fred Callaway
New York University
Julian Coda-Forno
Julian Coda-Forno
ELLIS, Helmholtz/TUM
LLMsCognitive ScienceMeta-learningDeep LearningReinforcement Learning
Peter Dayan
Peter Dayan
MPI for Biological Cybernetics
Theoretical Neuroscience
Can Demircan
Can Demircan
Helmholtz Munich
machine learningcognitive science
M
Maria K. Eckstein
Google DeepMind
N
No'emi 'EltetHo
Max Planck Institute for Biological Cybernetics
Thomas L. Griffiths
Thomas L. Griffiths
Professor of Psychology and Computer Science, Princeton University
Computational Models of CognitionCognitive ScienceMachine LearningCognitive PsychologyBayesian Statistics
S
Susanne Haridi
Helmholtz Munich, Max Planck School of Cognition
A
Akshay Jagadish
Helmholtz Munich, University of Tuebingen, Max Planck Institute for Biological Cybernetics
J
Ji-An Li
University of California San Diego
A
Alexander Kipnis
Helmholtz Munich
Sreejan Kumar
Sreejan Kumar
PhD Candidate, Princeton University
Cognitive ScienceMachine LearningComputational Neuroscience
T
Tobias Ludwig
University of Tuebingen, Max Planck Institute for Biological Cybernetics
M
Marvin Mathony
Helmholtz Munich
M
Marcelo Mattar
New York University
A
Alireza Modirshanechi
Helmholtz Munich
Surabhi S. Nath
Surabhi S. Nath
Doctoral Student, Max Planck School of Cognition
Decision MakingAestheticsCreativityReward Learning
Joshua C. Peterson
Joshua C. Peterson
Boston University
M
Milena Rmuš
Helmholtz Munich
E
Evan M. Russek
Princeton University
Tankred Saanum
Tankred Saanum
Harvard University
Cognitive scienceDeep reinforcement learning
N
Natalia Scharfenberg
Max Planck Institute for Biological Cybernetics
J
Johannes A. Schubert
Max Planck Institute for Biological Cybernetics
Luca M. Schulze Buschoff
Luca M. Schulze Buschoff
Helmholtz Munich
Machine LearningCognitive Science
N
Nishad Singhi
TU Darmstadt
Xin Sui
Xin Sui
University of Tuebingen, Max Planck Institute for Biological Cybernetics
Mirko Thalmann
Mirko Thalmann
Institute for Human-Centered AI
cognitive processesmental representationsmemory
F
Fabian Theis
Helmholtz Munich
V
Vuong Truong
Max Planck Institute for Biological Cybernetics
Vishaal Udandarao
Vishaal Udandarao
PhD Student, University of Tübingen & University of Cambridge
Data-centric MLFoundation ModelsVision and LanguageComputer Vision
Konstantinos Voudouris
Konstantinos Voudouris
Postdoctoral Research Scientist, Helmholtz Munich
AI EvaluationCognitive SciencePhilosophy of ScienceLinguistics
Robert Wilson
Robert Wilson
Georgia Institute of Technology
computational and cognitive neuroscience
Kristin Witte
Kristin Witte
Helmholtz Munich
computational psychiatryneuroscience
Shuchen Wu
Shuchen Wu
Allen Institute & University of Washington
ChunkingAbstraction LearningRepresentation LearningCognitive ScienceMachine Learning
D
Dirk Wulff
University of Basel, Max Planck Institute for Human Development
H
Huadong Xiong
Georgia Institute of Technology
Eric Schulz
Eric Schulz
Helmholtz Munich
Cognitive ScienceMachine LearningComputational NeuroscienceLarge Language Models