Event-based Stereo Depth Estimation: A Survey

📅 2024-09-26
🏛️ arXiv.org
📈 Citations: 3
Influential: 0
📄 PDF
🤖 AI Summary
Event-camera-based stereo depth estimation suffers from fragmented literature and a lack of systematic surveys due to its inherently interdisciplinary nature. Method: This paper provides the first comprehensive review of three decades of progress, covering both instantaneous matching and long-term SLAM-based approaches; it systematically categorizes deep learning architectures (CNNs and Transformers) and dedicated stereo event datasets, and proposes standardized protocols for reproducible benchmark construction. Contribution/Results: It introduces a unified evaluation framework to clarify technical boundaries; identifies performance–efficiency co-optimization as the central challenge; and reveals critical research gaps at the intersections of neuromorphic circuit design, classical stereo matching, deep learning, and SLAM integration. The survey serves both newcomers seeking foundational guidance and experts requiring state-of-the-art reference, thereby advancing standardization and practical deployment of event-based stereo depth estimation.

Technology Category

Application Category

📝 Abstract
Stereopsis has widespread appeal in robotics as it is the predominant way by which living beings perceive depth to navigate our 3D world. Event cameras are novel bio-inspired sensors that detect per-pixel brightness changes asynchronously, with very high temporal resolution and high dynamic range, enabling machine perception in high-speed motion and broad illumination conditions. The high temporal precision also benefits stereo matching, making disparity (depth) estimation a popular research area for event cameras ever since its inception. Over the last 30 years, the field has evolved rapidly, from low-latency, low-power circuit design to current deep learning (DL) approaches driven by the computer vision community. The bibliography is vast and difficult to navigate for non-experts due its highly interdisciplinary nature. Past surveys have addressed distinct aspects of this topic, in the context of applications, or focusing only on a specific class of techniques, but have overlooked stereo datasets. This survey provides a comprehensive overview, covering both instantaneous stereo and long-term methods suitable for simultaneous localization and mapping (SLAM), along with theoretical and empirical comparisons. It is the first to extensively review DL methods as well as stereo datasets, even providing practical suggestions for creating new benchmarks to advance the field. The main advantages and challenges faced by event-based stereo depth estimation are also discussed. Despite significant progress, challenges remain in achieving optimal performance in not only accuracy but also efficiency, a cornerstone of event-based computing. We identify several gaps and propose future research directions. We hope this survey inspires future research in this area, by serving as an accessible entry point for newcomers, as well as a practical guide for seasoned researchers in the community.
Problem

Research questions and friction points this paper is trying to address.

Surveying event-based stereo depth estimation methods and challenges
Reviewing deep learning approaches and stereo datasets comprehensively
Identifying gaps and proposing future research directions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Event cameras detect brightness changes asynchronously
Deep learning approaches drive current research
Survey reviews datasets and SLAM methods
🔎 Similar Papers
No similar papers found.
S
Suman Ghosh
TU Berlin and Robotics Institute Germany, Berlin, Germany
G
Guillermo Gallego
Science of Intelligence Excellence Cluster and Einstein Center Digital Future, Berlin, Germany