Rebalancing gradient to improve self-supervised co-training of depth, odometry and optical flow predictions

๐Ÿ“… 2026-05-08
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF

career value

222K/year
๐Ÿค– AI Summary
This work addresses the gradient imbalance and learning instability caused by moving objects in self-supervised joint training of depth, odometry, and optical flow networks. To mitigate these issues, the authors propose CoopNet, a framework that enforces consistency between depthโ€“odometry and optical flow reconstructions to automatically identify and exclude dynamic regions. Furthermore, a hybrid loss function is designed based on the distribution of photometric reconstruction errors, enabling dynamic reweighting of multi-task gradients to promote balanced optimization. Experiments on the KITTI and Cityscapes datasets demonstrate that CoopNet significantly improves fairness and performance in multi-task co-optimization, achieving depth, odometry, and optical flow estimation results that match or surpass current state-of-the-art methods.
๐Ÿ“ Abstract
We present CoopNet, an approach that improves the cooperation of co-trained networks by dynamically adapting the apportionment of gradient, to ensure equitable learning progress. It is applied to motion-aware self-supervised prediction of depth maps, by introducing a new hybrid loss, based on a distribution model of photo-metric reconstruction errors made by, on the one hand the depth + odometry paired networks, and on the other hand the optical flow network. This model essentially assumes that the pixels from moving objects (that must be discarded for training depth and odometry), correspond to those where the two reconstructions strongly disagree. We justify this model by theoretical considerations and experimental evidences. A comparative evaluation on KITTI and CityScapes datasets shows that CoopNet improves or is comparable to the state-of-the-art in depth, odometry and optical flow predictions.
Problem

Research questions and friction points this paper is trying to address.

self-supervised learning
depth prediction
visual odometry
optical flow
co-training
Innovation

Methods, ideas, or system contributions that make the work stand out.

gradient rebalancing
self-supervised co-training
hybrid loss
photometric reconstruction error
motion-aware depth estimation
๐Ÿ”Ž Similar Papers
No similar papers found.