SpikeProphecy: A Large-Scale Benchmark for Autoregressive Neural Population Forecasting

📅 2026-05-13
📈 Citations: 0
Influential: 0
📄 PDF

career value

223K/year
🤖 AI Summary
Existing models for predicting neural population activity rely solely on Pearson correlation, neglecting critical structural information such as temporal dynamics, spatial patterns, and amplitude relationships. To address this limitation, this work establishes the first large-scale autoregressive prediction benchmark using 105 Neuropixels sessions encompassing approximately 89,800 neurons, evaluating seven architectural families—including state-space models (SSMs), Transformers, LSTMs, and spiking neural networks (SNNs). The study introduces a novel evaluation framework that decomposes predictive performance into temporal fidelity, spatial pattern accuracy, and amplitude-invariant alignment. This approach reveals a hierarchical organization of predictability across brain regions (ΔR² = 0.018), identifies a sub-Poisson lower bound on prediction error, and demonstrates the limitations of both artificial-to-spiking neural network distillation and conventional metrics in capturing biophysically constrained performance limits.
📝 Abstract
Neural population models, which predict the joint firing of many simultaneously recorded neurons forward in time, are typically evaluated by a single aggregate Pearson correlation $r$ between predicted and actual spike counts, a number that masks critical structure. We argue that how we evaluate spike forecasting matters as much as what we build, and introduce SpikeProphecy, the first large-scale benchmark for causal, autoregressive spike-count forecasting on real electrophysiology recordings. Our core contribution is a population metric decomposition that separates aggregate performance into temporal fidelity, spatial pattern accuracy, and magnitude-invariant alignment. The decomposition surfaces aspects of the underlying data that an aggregate scalar collapses together. We apply the protocol to 105 Neuropixels sessions (Steinmetz 2019 + IBL Repeated Site; ~89,800 neurons) with seven architecture baselines spanning four structural families: four SSMs (three diagonal and one non-diagonal), a Transformer, an LSTM, and a spiking network. The decomposition surfaces a brain-region predictability ranking that reproduces across all seven baselines and survives ANCOVA correction for firing-statistics constraints (region $ΔR^2 = 0.018$ above the firing-statistics covariates). It also exposes a sub-Poisson evaluation floor where rigorous metrics combine with genuine biophysical constraints on regular spike trains, and yields a negative result on KL-on-output-rates distillation for ANN-to-SNN transfer in this Poisson count domain.
Problem

Research questions and friction points this paper is trying to address.

neural population forecasting
spike prediction
evaluation metrics
autoregressive modeling
electrophysiology
Innovation

Methods, ideas, or system contributions that make the work stand out.

SpikeProphecy
autoregressive forecasting
metric decomposition
neural population modeling
sub-Poisson evaluation
J
John R. Minnick
Department of Electrical and Computer Engineering, University of California, Santa Cruz, CA, USA; UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
J
Jinghui Geng
UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA; Department of Computer Science and Engineering, University of California, Santa Cruz, CA, USA
K
Kamran Hussain
Department of Applied Mathematics, University of California, Santa Cruz, CA, USA
J
Jesus Gonzalez-Ferrer
UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA; Department of Biomolecular Engineering, University of California, Santa Cruz, CA, USA
A
Ash Robbins
Department of Electrical and Computer Engineering, University of California, Santa Cruz, CA, USA; UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
M
Mohammed A. Mostajo-Radji
UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
David Haussler
David Haussler
Scientific Director, UC Santa Cruz Genomics Institute, University of California, Santa Cruz
genomicscomputer sciencemolecular biologyevolutioncancer
Jason K. Eshraghian
Jason K. Eshraghian
University of California, Santa Cruz, Assistant Professor
lightweight machine learningneuromorphic computingspiking neural networks
Mircea Teodorescu
Mircea Teodorescu
Associate Professor, University of California Santa Cruz