🤖 AI Summary
This study addresses the overreliance of existing wildlife re-identification models on non-discriminative cues such as background or silhouette, which inflates performance metrics while compromising biological plausibility. Focusing on jaguar re-identification, the work proposes the first diagnostic framework that explicitly disentangles foreground and background while incorporating mirror symmetry constraints. To support rigorous evaluation, the authors introduce the Pantanal Jaguar Benchmark, a novel dataset featuring pixel-level masks and an identity-balanced protocol. Systematic experiments are conducted using inpainted images to isolate foreground content, combined with strategies including ArcFace fine-tuning, antisymmetric regularization, and Lorentzian hyperbolic embeddings. The results reveal a critical dependence of current methods on background leakage and demonstrate that the proposed approach effectively enhances both the discriminability of spot-pattern features and biological validity.
📝 Abstract
Jaguar re-identification (re-ID) from citizen-science imagery can look strong on standard retrieval metrics while still relying on the wrong evidence, such as background context or silhouette shape, instead of the coat pattern that defines identity. We introduce a diagnostic framework for wildlife re-ID with two axes: a leakage-controlled context ratio, background/foreground, computed from inpainted background-only versus foreground-only images, and a laterality diagnostic based on cross-flank retrieval and mirror self-similarity. To make these diagnostics measurable, we curate a Pantanal jaguar benchmark with per-pixel segmentation masks and an identity-balanced evaluation protocol. We then use representative mitigation families, ArcFace fine-tuning, anti-symmetry regularization, and Lorentz hyperbolic embeddings, as case studies under the same evaluation lens. The goal is not only to ask which model ranks best, but also what visual evidence it uses to do so.