EPRBench: A High-Quality Benchmark Dataset for Event Stream Based Visual Place Recognition

📅 2026-02-13
📈 Citations: 0
Influential: 0
📄 PDF

Technology Category

Application Category

📝 Abstract
Event stream-based Visual Place Recognition (VPR) is an emerging research direction that offers a compelling solution to the instability of conventional visible-light cameras under challenging conditions such as low illumination, overexposure, and high-speed motion. Recognizing the current scarcity of dedicated datasets in this domain, we introduce EPRBench, a high-quality benchmark specifically designed for event stream-based VPR. EPRBench comprises 10K event sequences and 65K event frames, collected using both handheld and vehicle-mounted setups to comprehensively capture real-world challenges across diverse viewpoints, weather conditions, and lighting scenarios. To support semantic-aware and language-integrated VPR research, we provide LLM-generated scene descriptions, subsequently refined through human annotation, establishing a solid foundation for integrating LLMs into event-based perception pipelines. To facilitate systematic evaluation, we implement and benchmark 15 state-of-the-art VPR algorithms on EPRBench, offering a strong baseline for future algorithmic comparisons. Furthermore, we propose a novel multi-modal fusion paradigm for VPR: leveraging LLMs to generate textual scene descriptions from raw event streams, which then guide spatially attentive token selection, cross-modal feature fusion, and multi-scale representation learning. This framework not only achieves highly accurate place recognition but also produces interpretable reasoning processes alongside its predictions, significantly enhancing model transparency and explainability. The dataset and source code will be released on https://github.com/Event-AHU/Neuromorphic_ReID
Problem

Research questions and friction points this paper is trying to address.

Event-based Vision
Visual Place Recognition
Benchmark Dataset
Neuromorphic Sensing
Event Stream
Innovation

Methods, ideas, or system contributions that make the work stand out.

event-based visual place recognition
benchmark dataset
large language models
multimodal fusion
explainable AI
🔎 Similar Papers
No similar papers found.
X
Xiao Wang
School of Computer Science and Technology, Anhui University, Hefei, China
X
Xingxing Xiong
School of Computer Science and Technology, Anhui University, Hefei, China
J
Jinfeng Gao
School of Computer Science and Technology, Anhui University, Hefei, China
X
Xufeng Lou
School of Computer Science and Technology, Anhui University, Hefei, China
Bo Jiang
Bo Jiang
Anhui University
Computer Vision and Pattern Recognition
S
Si-bao Chen
School of Computer Science and Technology, Anhui University, Hefei, China
Yaowei Wang
Yaowei Wang
The Hong Kong Polytechnic University
Y
Yonghong Tian
Peng Cheng Laboratory, Shenzhen, China; School of Computer Science, Peking University, China; Shenzhen Graduate School, Peking University, China