OmniMouse: Scaling properties of multi-modal, multi-task Brain Models on 150B Neural Tokens

📅 2026-04-20
📈 Citations: 0
Influential: 0
📄 PDF

career value

230K/year
🤖 AI Summary
This study investigates whether scaling laws established in artificial intelligence—governing the relationship between data, model size, and performance—extend to modeling brain activity, particularly in multimodal, multitask neural prediction settings. Leveraging a large-scale dataset comprising 150 billion neural tokens from the mouse visual cortex, we introduce OmniMouse, a unified model capable of flexibly combining diverse tasks such as neural response prediction and behavioral decoding. Systematic analysis reveals, for the first time, that brain model performance scales robustly with data volume but saturates with increasing parameter count, challenging the dominant AI paradigm that prioritizes parameter scaling. This finding suggests the potential existence of phase transitions or emergent capabilities in neural modeling. OmniMouse consistently outperforms task-specific baselines across nearly all benchmarks, underscoring the pivotal role of data-driven approaches in computational neuroscience.

Technology Category

Application Category

📝 Abstract
Scaling data and artificial neural networks has transformed AI, driving breakthroughs in language and vision. Whether similar principles apply to modeling brain activity remains unclear. Here we leveraged a dataset of 3.1 million neurons from the visual cortex of 73 mice across 323 sessions, totaling more than 150 billion neural tokens recorded during natural movies, images and parametric stimuli, and behavior. We train multi-modal, multi-task models that support three regimes flexibly at test time: neural prediction, behavioral decoding, neural forecasting, or any combination of the three. OmniMouse achieves state-of-the-art performance, outperforming specialized baselines across nearly all evaluation regimes. We find that performance scales reliably with more data, but gains from increasing model size saturate. This inverts the standard AI scaling story: in language and computer vision, massive datasets make parameter scaling the primary driver of progress, whereas in brain modeling -- even in the mouse visual cortex, a relatively simple system -- models remain data-limited despite vast recordings. The observation of systematic scaling raises the possibility of phase transitions in neural modeling, where larger and richer datasets might unlock qualitatively new capabilities, paralleling the emergent properties seen in large language models. Code available at https://github.com/enigma-brain/omnimouse.
Problem

Research questions and friction points this paper is trying to address.

brain modeling
scaling laws
neural data
multi-modal learning
model generalization
Innovation

Methods, ideas, or system contributions that make the work stand out.

neural scaling
multi-modal brain modeling
data-limited regime
emergent capabilities
OmniMouse
💼 Related Jobs
Konstantin F. Willeke
Konstantin F. Willeke
Stanford University
Artificial IntelligenceDeep LearningComputational Neuroscience
P
Polina Turishcheva
Institute of Computer Science and Campus Institute Data Science, University Göttingen
A
Alex Gilbert
Department of Ophthalmology, Byers Eye Institute, Stanford University
G
Goirik Chakrabarty
Institute of Computer Science and Campus Institute Data Science, University Göttingen
H
Hasan A. Bedel
Department of Ophthalmology, Byers Eye Institute, Stanford University
P
Paul G. Fahey
Department of Ophthalmology, Byers Eye Institute, Stanford University
Y
Yongrong Qiu
Department of Ophthalmology, Byers Eye Institute, Stanford University
Marissa A. Weis
Marissa A. Weis
Google
neurosciencedeep learningmachine learning
Michaela Vystrčilová
Michaela Vystrčilová
University of Göttingen
T
Taliah Muhammad
Department of Ophthalmology, Byers Eye Institute, Stanford University
L
Lydia Ntanavara
Department of Ophthalmology, Byers Eye Institute, Stanford University
R
Rachel E. Froebe
Department of Ophthalmology, Byers Eye Institute, Stanford University
K
Kayla Ponder
Department of Ophthalmology, Byers Eye Institute, Stanford University
Z
Zheng Huan Tan
Department of Ophthalmology, Byers Eye Institute, Stanford University
Emin Orhan
Emin Orhan
Unknown affiliation
machine learningartificial intelligencecognitive sciencecomputational neuroscience
Erick Cobos
Erick Cobos
Max Planck Institute for Intelligent Systems
Machine LearningRepresentation Learning
Sophia Sanborn
Sophia Sanborn
Stanford University
deep learningtheoretical neurosciencemechanistic interpretability
Katrin Franke
Katrin Franke
NTNU - Norwegian University of Science and Technology
Machine LearningDigital ForensicsComputational IntelligenceComputational Forensics
F
Fabian H. Sinz
Institute of Computer Science and Campus Institute Data Science, University Göttingen
Alexander S. Ecker
Alexander S. Ecker
University of Göttingen, Germany
Computational NeuroscienceVisionMachine LearningComputer VisionData Science
A
Andreas S. Tolias
Department of Ophthalmology, Byers Eye Institute, Stanford University