Voxel-Level Brain States Prediction Using Swin Transformer

📅 2025-06-13
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the challenge of predicting voxel-wise BOLD signal dynamics in human resting-state fMRI 7.2 seconds into the future, using only 23.04 seconds of raw time-series data. We propose the first whole-brain, voxel-level 4D Shifted Window Transformer (4D Swin Transformer), which directly models spatiotemporal dependencies across all four dimensions (x, y, z, t) without spatial downsampling, regional parcellation, or functional aggregation. The architecture comprises a 4D Swin encoder and a lightweight convolutional decoder, trained and validated on resting-state fMRI data (REST1) from 100 unrelated subjects in the Human Connectome Project. Results demonstrate that predicted signals faithfully reproduce ground-truth BOLD contrast and inter-voxel dynamic coupling patterns, achieving high-fidelity spatiotemporal reconstruction at the native 1.6 mm³ isotropic resolution. This work establishes a novel paradigm for accelerating fMRI acquisition, enabling real-time dynamic brain–computer interfaces, and probing intrinsic neurodynamics.

Technology Category

Application Category

📝 Abstract
Understanding brain dynamics is important for neuroscience and mental health. Functional magnetic resonance imaging (fMRI) enables the measurement of neural activities through blood-oxygen-level-dependent (BOLD) signals, which represent brain states. In this study, we aim to predict future human resting brain states with fMRI. Due to the 3D voxel-wise spatial organization and temporal dependencies of the fMRI data, we propose a novel architecture which employs a 4D Shifted Window (Swin) Transformer as encoder to efficiently learn spatio-temporal information and a convolutional decoder to enable brain state prediction at the same spatial and temporal resolution as the input fMRI data. We used 100 unrelated subjects from the Human Connectome Project (HCP) for model training and testing. Our novel model has shown high accuracy when predicting 7.2s resting-state brain activities based on the prior 23.04s fMRI time series. The predicted brain states highly resemble BOLD contrast and dynamics. This work shows promising evidence that the spatiotemporal organization of the human brain can be learned by a Swin Transformer model, at high resolution, which provides a potential for reducing the fMRI scan time and the development of brain-computer interfaces in the future.
Problem

Research questions and friction points this paper is trying to address.

Predict future human resting brain states using fMRI
Learn spatio-temporal fMRI data with Swin Transformer
Reduce fMRI scan time and improve brain-computer interfaces
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses 4D Swin Transformer for spatio-temporal learning
Combines convolutional decoder for high-resolution prediction
Predicts brain states from prior fMRI time series
🔎 Similar Papers
No similar papers found.
Y
Yifei Sun
School of Biomedical Engineering and Brain and Mind Center, University of Sydney, Sydney, NSW 2050 Australia
D
Daniel Chahine
School of Biomedical Engineering, University of Technology Sydney, Sydney, NSW 2007 Australia
Q
Qinghao Wen
School of Aerospace, Mechanical and Mechatronic Engineering, University of Sydney, Sydney, NSW 2050 Australia
Tianming Liu
Tianming Liu
Distinguished Research Professor of Computer Science, University of Georgia
BrainBrain-Inspired AILLMArtificial General IntelligenceQuantum AI
X
Xiang Li
Department of Radiology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA
Yixuan Yuan
Yixuan Yuan
Associate Professor in Chinese University of Hong Kong
Medical image analysisAI in healthcareBrain data analysisEndoscopy
Fernando Calamante
Fernando Calamante
University of Sydney; Director of Sydney Imaging, Professor at Faculty of Engineering
MRIbrain structural and functional connectivityperfusion MRIneruroimaging
Jinglei Lv
Jinglei Lv
University of Sydney
Biomedical ImagingMedical Image AnalysisNeuroimagingEEGNeuroscience