A Multimodal Pre-trained Network for Integrated EEG-Video Seizure Detection

📅 2026-04-29
📈 Citations: 0
Influential: 0
📄 PDF

career value

249K/year
🤖 AI Summary
This work addresses the limitations of existing unimodal epileptic seizure detection methods, which are prone to interference from motion artifacts or benign behaviors and rely on time-consuming manual review of synchronized video-EEG data. The authors propose EEGVFusion, a novel framework that integrates self-supervised EEG representation learning, 3D convolutional video encoding, optimal transport-based alignment, and bidirectional cross-attention mechanisms to enable synergistic multimodal analysis of neural and behavioral signals. Evaluated on an expert-annotated mouse EEG-video dataset, the method achieves a balanced accuracy of 0.9957 and a false-positive rate of 0.6250 FP/h under random split evaluation. In the more challenging leave-one-subject-out setting, it further reduces the false-positive rate to 0.4833 FP/h while maintaining 100% event sensitivity, substantially outperforming current state-of-the-art approaches.
📝 Abstract
Reliable seizure detection in mouse models is essential for preclinical epilepsy research, yet manual review of synchronized video-EEG recordings is labor-intensive and single-modality systems fail for complementary reasons: video-based methods are easily confounded by benign behaviors, whereas EEG-based methods are vulnerable to ictal motion artifacts. We present EEGVFusion, a multimodal framework that combines self-supervised EEG representation learning, spatio-temporal video encoding, optimal-transport alignment, and bidirectional cross-attention to integrate neural and behavioral evidence. We also curate an expert-annotated dataset of synchronized EEG and video recordings comprising 93 sessions from 15 mice for training and evaluation. In the random-session split, EEGVFusion achieved a Balanced Accuracy of 0.9957 with perfect event sensitivity and an Event FAR of 0.6250 FP/h, indicating strong seizure detection performance with a low false-alarm burden. In a single held-out-subject evaluation with Subject 110 reserved for testing, EEGVFusion achieved a Balanced Accuracy of 0.9718 and reduced Event FAR from 2.7250 FP/h for the EEG-only counterpart to 0.4833 FP/h while preserving perfect event sensitivity. Targeted ablations further showed that EEG pre-training and OT alignment help reduce false alarms while preserving event sensitivity.
Problem

Research questions and friction points this paper is trying to address.

seizure detection
multimodal integration
EEG-video synchronization
motion artifacts
false alarms
Innovation

Methods, ideas, or system contributions that make the work stand out.

multimodal fusion
self-supervised EEG representation
optimal transport alignment
cross-attention
seizure detection
T
Tong Lu
Beijing Institute for Brain Research, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 102206, China; Chinese Institute for Brain Research, Beijing, Beijing, 102206, China
K
Ke Xu
Department of Neurosurgery, SanBo Brain Hospital, Capital Medical University, Beijing, 100018, China; Beijing Institute for Brain Research, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 102206, China; Chinese Institute for Brain Research, Beijing, Beijing, 102206, China
Z
Zimo Zhang
Beijing Institute for Brain Research, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 102206, China; Chinese Institute for Brain Research, Beijing, Beijing, 102206, China
Z
Zitong Zhao
Department of Neurosurgery, SanBo Brain Hospital, Capital Medical University, Beijing, 100018, China; Beijing Institute for Brain Research, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 102206, China; Chinese Institute for Brain Research, Beijing, Beijing, 102206, China
D
Danwei Weng
GenAns Biotechnology Co., Ltd, Beijing, 102206, China
Ruiyu Wang
Ruiyu Wang
PhD student of EECS, KTH Royal Institute of Technology
RoboticsComputer Vision
Miao Liu
Miao Liu
Institute of Applied Ecology, Chinese Academy Sciences
Landscape EcologyUrban Ecology
Z
Zizuo Zhang
Beijing Institute for Brain Research, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 102206, China; Chinese Institute for Brain Research, Beijing, Beijing, 102206, China
J
Jingyi Yao
Beijing Institute for Brain Research, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 102206, China; Chinese Institute for Brain Research, Beijing, Beijing, 102206, China
Y
Yixuan Zhao
Beijing Institute for Brain Research, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 102206, China; Chinese Institute for Brain Research, Beijing, Beijing, 102206, China
Wenchao Zhang
Wenchao Zhang
Staff Image Scientist at OmniVision
Computer Visionface recognitionsmart sensor
Min Wang
Min Wang
Beijing Academy of Quantum Information Sciences, China
physics
G
Guoming Luan
Department of Neurosurgery, SanBo Brain Hospital, Capital Medical University, Beijing, 100018, China; Laboratory for Clinical Medicine, Capital Medical University, Beijing, 100069, China; Center of Epilepsy, Beijing Institute for Brain Disorders, Sanbo Brain Hospital, Capital Medical University, Beijing, 100018, China
Minmin Luo
Minmin Luo
Chinese Institute for Brain Research, Beijing
neuroscience
Z
Zhifeng Yue
Beijing Institute for Brain Research, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 102206, China; Chinese Institute for Brain Research, Beijing, Beijing, 102206, China; Beijing Key Laboratory of Brain Science and Brain-Machine Interface, Beijing, 102206, China