Domain-Adaptive Pretraining Improves Primate Behavior Recognition

📅 2025-09-15
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the high annotation cost and low data efficiency in primate behavior recognition, this paper proposes Domain-Adaptive Pretraining (DAP), a self-supervised continual pretraining method built upon the V-JEPA architecture, trained exclusively on unlabeled primate videos. DAP significantly reduces reliance on labeled data while exhibiting strong domain transfer capability across diverse primate behavior recognition tasks. Evaluated on two benchmark datasets—PanAf and ChimpACT—DAP achieves absolute improvements of 6.1% in accuracy and 6.3% in mean Average Precision (mAP) over state-of-the-art action recognition models. This work represents the first integration of JEPA-style self-supervised learning with primate behavioral analysis, establishing a scalable and transferable general-purpose modeling framework for low-resource animal behavior research.

Technology Category

Application Category

📝 Abstract
Computer vision for animal behavior offers promising tools to aid research in ecology, cognition, and to support conservation efforts. Video camera traps allow for large-scale data collection, but high labeling costs remain a bottleneck to creating large-scale datasets. We thus need data-efficient learning approaches. In this work, we show that we can utilize self-supervised learning to considerably improve action recognition on primate behavior. On two datasets of great ape behavior (PanAf and ChimpACT), we outperform published state-of-the-art action recognition models by 6.1 %pt. accuracy and 6.3 %pt. mAP, respectively. We achieve this by utilizing a pretrained V-JEPA model and applying domain-adaptive pretraining (DAP), i.e. continuing the pretraining with in-domain data. We show that most of the performance gain stems from the DAP. Our method promises great potential for improving the recognition of animal behavior, as DAP does not require labeled samples. Code is available at https://github.com/ecker-lab/dap-behavior
Problem

Research questions and friction points this paper is trying to address.

Improving primate behavior recognition accuracy
Reducing labeling costs for animal behavior datasets
Applying self-supervised domain-adaptive pretraining techniques
Innovation

Methods, ideas, or system contributions that make the work stand out.

Domain-adaptive pretraining with in-domain data
Self-supervised V-JEPA model utilization
Unlabeled sample behavior recognition improvement
F
Felix B. Mueller
Institute of Computer Science and Campus Institute Data Science, University of Göttingen, Germany
T
Timo Lueddecke
Institute of Computer Science and Campus Institute Data Science, University of Göttingen, Germany
Richard Vogg
Richard Vogg
University of Göttingen
computer visiondeep learninganimal behavior
Alexander S. Ecker
Alexander S. Ecker
University of Göttingen, Germany
Computational NeuroscienceVisionMachine LearningComputer VisionData Science