Long-tailed Species Recognition in the NACTI Wildlife Dataset

📅 2025-10-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
The North American Camera Trap Image (NACTI) dataset exhibits severe long-tailed class imbalance, with head classes dominating and tail classes—particularly rare wildlife species—represented by very few samples. Method: We systematically evaluate and enhance long-tailed recognition paradigms, proposing an optimized training framework integrating weighted cross-entropy (WCE) loss, sensitivity regularization, and adaptive learning rate scheduling, implemented end-to-end on the PyTorch Wildlife platform. Results: Our method achieves 99.40% Top-1 accuracy on the standard NACTI test set (+3.89 points over baseline) and, more critically, attains 52.55% accuracy on our newly constructed de-biased test set (ENA-Detection), demonstrating substantially improved cross-domain generalization. Experiments expose fundamental limitations of existing approaches under extreme domain shift. This work establishes a reproducible, robust new baseline for fine-grained, long-tailed wildlife recognition.

Technology Category

Application Category

📝 Abstract
As most''in the wild''data collections of the natural world, the North America Camera Trap Images (NACTI) dataset shows severe long-tailed class imbalance, noting that the largest'Head'class alone covers>50% of the 3.7M images in the corpus. Building on the PyTorch Wildlife model, we present a systematic study of Long-Tail Recognition methodologies for species recognition on the NACTI dataset covering experiments on various LTR loss functions plus LTR-sensitive regularisation. Our best configuration achieves 99.40% Top-1 accuracy on our NACTI test data split, substantially improving over a 95.51% baseline using standard cross-entropy with Adam. This also improves on previously reported top performance in MLWIC2 at 96.8% albeit using partly unpublished (potentially different) partitioning, optimiser, and evaluation protocols. To evaluate domain shifts (e.g. night-time captures, occlusion, motion-blur) towards other datasets we construct a Reduced-Bias Test set from the ENA-Detection dataset where our experimentally optimised long-tail enhanced model achieves leading 52.55% accuracy (up from 51.20% with WCE loss), demonstrating stronger generalisation capabilities under distribution shift. We document the consistent improvements of LTR-enhancing scheduler choices in this NACTI wildlife domain, particularly when in tandem with state-of-the-art LTR losses. We finally discuss qualitative and quantitative shortcomings that LTR methods cannot sufficiently address, including catastrophic breakdown for'Tail'classes under severe domain shift. For maximum reproducibility we publish all dataset splits, key code, and full network weights.
Problem

Research questions and friction points this paper is trying to address.

Addressing severe class imbalance in wildlife species recognition datasets
Improving recognition accuracy for long-tailed species distributions
Enhancing model generalization under domain shifts and biases
Innovation

Methods, ideas, or system contributions that make the work stand out.

Long-tail recognition loss functions for species classification
LTR-sensitive regularization to handle class imbalance
Domain shift adaptation with reduced-bias test evaluation
Z
Zehua Liu
School of Computer Science, University of Bristol, MVB Woodland Rd, BS8 1UB, Bristol, UK
Tilo Burghardt
Tilo Burghardt
University of Bristol
Animal BiometricsAI for ConservationConservation TechnologyComputer VisionImageomics