Motif Channel Opened in a White-Box: Stereo Matching via Motif Correlation Graph

📅 2024-11-19
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
To address geometric structure degradation and poor model interpretability in learning-based stereo matching for safety-critical applications such as autonomous driving, this paper proposes MoCha-V2. Methodologically, it introduces (1) Motif-Correlation Graphs (MCGs), a novel graph-based mechanism that explicitly models repetitive local textures (motifs) within feature channels, enabling geometrically faithful and interpretable disparity reconstruction; and (2) a wavelet inverse transform coupled with multi-frequency feature fusion, enhancing fine-grained texture representation and disparity estimation accuracy. Evaluated on the Middlebury benchmark, MoCha-V2 achieves state-of-the-art performance at the time of publication, significantly improving matching completeness, robustness to occlusions and textureless regions, and structural interpretability. By jointly optimizing precision and transparency, MoCha-V2 establishes a new paradigm for trustworthy visual perception in safety-critical systems.

Technology Category

Application Category

📝 Abstract
Real-world applications of stereo matching, such as autonomous driving, place stringent demands on both safety and accuracy. However, learning-based stereo matching methods inherently suffer from the loss of geometric structures in certain feature channels, creating a bottleneck in achieving precise detail matching. Additionally, these methods lack interpretability due to the black-box nature of deep learning. In this paper, we propose MoCha-V2, a novel learning-based paradigm for stereo matching. MoCha-V2 introduces the Motif Correlation Graph (MCG) to capture recurring textures, which are referred to as ``motifs"within feature channels. These motifs reconstruct geometric structures and are learned in a more interpretable way. Subsequently, we integrate features from multiple frequency domains through wavelet inverse transformation. The resulting motif features are utilized to restore geometric structures in the stereo matching process. Experimental results demonstrate the effectiveness of MoCha-V2. MoCha-V2 achieved 1st place on the Middlebury benchmark at the time of its release. Code is available at https://github.com/ZYangChen/MoCha-Stereo.
Problem

Research questions and friction points this paper is trying to address.

Stereomatching
Shape Information Loss
Explainability
Innovation

Methods, ideas, or system contributions that make the work stand out.

MoCha-V2
Motif Correlation Graph (MCG)
Stereomatching
🔎 Similar Papers
No similar papers found.
Ziyang Chen
Ziyang Chen
Peking University
Quantum key distributionQuantum random number generation
Y
Yongjun Zhang
College of Computer Science, the State Key Laboratory of Public Big Data, Guizhou University, Guiyang 550025, China
W
Wenting Li
School of Information Engineering, Guizhou University of Commerce, Guiyang 550021, China
Bingshu Wang
Bingshu Wang
Fuzhou University
Deformation MechanismTextureEBSDMagnesium
Y
Yong Zhao
Key Laboratory of Integrated Microsystems, Shenzhen Graduate School, Peking University, Shenzhen 518055, China
C
C. L. Philip Chen
School of Computer Science and Engineering, South China University of Technology, Guangzhou 510641, China