Robust and Flexible Omnidirectional Depth Estimation with Multiple 360-degree Cameras

📅 2024-09-23
📈 Citations: 0
Influential: 0
📄 PDF

career value

228K/year
🤖 AI Summary
Depth estimation in multi-360° camera surround-view systems suffers from poor robustness and generalization due to lens contamination and variable camera layouts. Method: We propose a geometry-aware spherical depth estimation framework. It introduces the generalized spherical epipolar isometric rectangular projection to simplify spherical matching constraints; designs a two-stage fusion with one-stage spherical sweeping for joint depth estimation; and employs spherical feature extraction and spherical sweeping to fuse multi-view redundant information. Contribution/Results: We construct the first realistic 360° synthetic dataset featuring physically plausible lens contamination and glare (12K panoramas + 3K ground-truth depth maps). Our method achieves state-of-the-art accuracy under contaminated inputs, supports arbitrary numbers and configurations of 360° cameras, and significantly outperforms existing approaches on our dataset and multiple public benchmarks.

Technology Category

Application Category

📝 Abstract
Omnidirectional depth estimation has received much attention from researchers in recent years. However, challenges arise due to camera soiling and variations in camera layouts, affecting the robustness and flexibility of the algorithm. In this paper, we use the geometric constraints and redundant information of multiple 360-degree cameras to achieve robust and flexible multi-view omnidirectional depth estimation. We implement two algorithms, in which the two-stage algorithm obtains initial depth maps by pairwise stereo matching of multiple cameras and fuses the multiple depth maps to achieve the final depth estimation; the one-stage algorithm adopts spherical sweeping based on hypothetical depths to construct a uniform spherical matching cost of the multi-camera images and obtain the depth. Additionally, a generalized epipolar equirectangular projection is introduced to simplify the spherical epipolar constraints. To overcome panorama distortion, a spherical feature extractor is implemented. Furthermore, a synthetic 360-degree dataset consisting of 12K road scene panoramas and 3K ground truth depth maps is presented to train and evaluate 360-degree depth estimation algorithms. Our dataset takes soiled camera lenses and glare into consideration, which is more consistent with the real-world environment. Experiments show that our two algorithms achieve state-of-the-art performance, accurately predicting depth maps even when provided with soiled panorama inputs. The flexibility of the algorithms is experimentally validated in terms of camera layouts and numbers.
Problem

Research questions and friction points this paper is trying to address.

Robust omnidirectional depth estimation
Flexible multi-camera depth algorithms
Overcoming camera soiling and layout variations
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multiple 360-degree cameras
Geometric constraints utilization
Spherical feature extraction
M
Ming Li
School of Electronic Science and Engineering, Nanjing University, Nanjing, China; School of Artificial Intelligence, Nanjing University of Information Science and Technology, Nanjing, China
X
Xuejiao Hu
School of Computer Engineering, Jinling Institute of Technology, Nanjing, China
X
Xueqian Jin
School of Electronic Science and Engineering, Nanjing University, Nanjing, China
J
Jinghao Cao
School of Electronic Science and Engineering, Nanjing University, Nanjing, China
Sidan Du
Sidan Du
Nanjing University
Image Processing and ControlMachine Learning
Y
Yang Li
School of Electronic Science and Engineering, Nanjing University, Nanjing, China; Suzhou High Technology Research Institute, Nanjing University, Suzhou, China