LeanStereo: A Leaner Backbone based Stereo Network

📅 2023-06-18

🏛️ IEEE International Joint Conference on Neural Network

📈 Citations: 0

✨ Influential: 0

career value

210K/year

🤖 AI Summary

To address the high computational cost and inference latency of end-to-end stereo matching models—hindering real-time deployment—this paper proposes a lightweight real-time stereo matching network. Our method introduces two key innovations: (1) a lightweight backbone architecture coupled with an attention-guided learnable cost volume, significantly reducing memory bandwidth consumption and arithmetic complexity; and (2) a novel co-optimization framework that jointly learns attention-weighted cost volumes and employs LogL1 loss, accelerating convergence while improving disparity accuracy recovery. Extensive experiments demonstrate that our approach achieves a 4× reduction in FLOPs compared to state-of-the-art methods such as ACVNet, attains real-time inference speeds of >30 FPS (9–14× faster), and maintains competitive accuracy on standard benchmarks.

Technology Category

Application Category

📝 Abstract

Recently, end-to-end deep networks based stereo matching methods, mainly because of their performance, have gained popularity. However, this improvement in performance comes at the cost of increased computational and memory bandwidth requirements, thus necessitating specialized hardware (GPUs); even then, these methods have large inference times compared to classical methods. This limits their applicability in real-world applications. Although we desire high accuracy stereo methods albeit with reasonable inference time. To this end, we propose a fast end-to-end stereo matching method. Majority of this speedup comes from integrating a leaner backbone. To recover the performance lost because of a leaner backbone, we propose to use learned attention weights based cost volume combined with LogL1 loss for stereo matching. Using LogL1 loss not only improves the overall performance of the proposed network but also leads to faster convergence. We do a detailed empirical evaluation of different design choices and show that our method requires $4 imes$ less operations and is also about 9 to $14 imes$ faster compared to the state of the art methods like ACVNet [1], LEAStereo [2] and CFNet [3] while giving comparable performance11Code: https://github.com/cogsys-tuebingen/LeanStereo

Problem

Research questions and friction points this paper is trying to address.

Reducing computational and memory requirements in stereo networks

Improving inference speed while maintaining accuracy

Balancing performance and efficiency for real-world applications

Innovation

Methods, ideas, or system contributions that make the work stand out.

Leaner backbone reduces computational requirements

Learned attention weights enhance cost volume

LogL1 loss improves performance and convergence

🔎 Similar Papers

No similar papers found.