Rethinking Unsupervised Cross-modal Flow Estimation: Learning from Decoupled Optimization and Consistency Constraint

📅 2025-09-29
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Addressing the challenges of large modality gaps and severe geometric misalignment in unsupervised cross-modal optical flow estimation, this paper proposes DCFlow. Methodologically, it introduces (1) a decoupled optimization strategy that separates modality translation from flow estimation, incorporating task-specific supervision; (2) a geometry-aware data synthesis pipeline coupled with an outlier-robust loss to enable reliable motion supervision without ground-truth flow labels; and (3) a cross-modal consistency constraint that jointly optimizes dual networks to enhance inter-modal geometric alignment. Evaluated on a newly constructed comprehensive cross-modal optical flow benchmark, DCFlow is compatible with various optical flow backbones and consistently outperforms existing unsupervised methods, achieving state-of-the-art performance.

Technology Category

Application Category

📝 Abstract
This work presents DCFlow, a novel unsupervised cross-modal flow estimation framework that integrates a decoupled optimization strategy and a cross-modal consistency constraint. Unlike previous approaches that implicitly learn flow estimation solely from appearance similarity, we introduce a decoupled optimization strategy with task-specific supervision to address modality discrepancy and geometric misalignment distinctly. This is achieved by collaboratively training a modality transfer network and a flow estimation network. To enable reliable motion supervision without ground-truth flow, we propose a geometry-aware data synthesis pipeline combined with an outlier-robust loss. Additionally, we introduce a cross-modal consistency constraint to jointly optimize both networks, significantly improving flow prediction accuracy. For evaluation, we construct a comprehensive cross-modal flow benchmark by repurposing public datasets. Experimental results demonstrate that DCFlow can be integrated with various flow estimation networks and achieves state-of-the-art performance among unsupervised approaches.
Problem

Research questions and friction points this paper is trying to address.

Unsupervised cross-modal flow estimation with decoupled optimization
Addressing modality discrepancy and geometric misalignment issues
Learning reliable motion supervision without ground-truth flow
Innovation

Methods, ideas, or system contributions that make the work stand out.

Decoupled optimization strategy with task-specific supervision
Geometry-aware data synthesis with outlier-robust loss
Cross-modal consistency constraint for joint network optimization
🔎 Similar Papers
No similar papers found.
R
Runmin Zhang
College of Information Science and Electronic Engineering, Zhejiang University
Jialiang Wang
Jialiang Wang
Research Scientist, Meta AI
Computer VisionGenerative AI
Si-Yuan Cao
Si-Yuan Cao
Zhejiang University
image alignmenthomography estimationimage fusionplace recognition
Z
Zhu Yu
College of Information Science and Electronic Engineering, Zhejiang University
J
Junchen Yu
College of Information Science and Electronic Engineering, Zhejiang University
G
Guangyi Zhang
College of Information Science and Electronic Engineering, Zhejiang University
H
Hui-Liang Shen
College of Information Science and Electronic Engineering, Zhejiang University