🤖 AI Summary
Homography estimation in multi-view football videos is severely challenged by large variations in camera pose and zoom, leading to poor performance of conventional feature-based methods.
Method: We propose H-RANSAC, the first homography estimation algorithm that operates without explicit feature extraction or point correspondence matching. Its core innovations include: (1) a geometric pre-filtering criterion grounded in camera cheirality, which effectively exploits concave quadrilateral configurations traditionally discarded by RANSAC; and (2) a posterior validation mechanism coupled with a probabilistic, adaptive iteration termination rule, with theoretical derivation of the maximum required iterations.
Results: Evaluated on a real-world 12-camera football video dataset, H-RANSAC achieves a 37% reduction in mean reprojection error and raises success rate to 92.4%, significantly outperforming state-of-the-art methods. The implementation is publicly available.
📝 Abstract
Estimating the homography matrix between images captured under radically different camera poses and zoom factors is a complex challenge. Traditional methods rely on the Random Sample Consensus (RANSAC) algorithm, which requires pairs of homologous points, pre-matched based on local image feature vectors. Sampling consensus is a core step in many Artificial Intelligence (AI) algorithms that enable computer systems to recognize patterns in data. In this paper, we propose H-RANSAC, an algorithm for homography estimation that eliminates the need for feature vectors or explicit point pairing, while it optionally supports point labeling into two classes. H-RANSAC introduces a novel geometric (cheiral) criterion that intelligently rejects implausible point configurations at the beginning of each iteration, while leveraging concave quadrilaterals typically discarded by similar algorithms. A post-hoc criterion at the end of each iteration improves accuracy further. Analytical derivations of the expected maximum iterations are provided, considering success probabilities and outlier rates, enabling adaptive performance tuning. The algorithm is validated on a demanding task: estimating homography between video frames of football matches captured by 12 cameras with highly divergent viewpoints. Results show that H-RANSAC significantly outperforms state-of-the-art classical methods, combined with deep learning-based salient point detection, in terms of average reprojection error and success rates. The relevant implementation is available in https://github.com/gnousias/H-RANSAC