FocalCount: Towards Class-Count Imbalance in Class-Agnostic Counting

📅 2025-02-15
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In category-agnostic object counting, dominant single-category images and mean squared error (MSE) loss induce severe undercounting bias for rare categories. To address this, we propose FocalCount: first, a category diversity feature estimator serves as a dynamic weighting factor to mitigate single-category dominance; second, a Focal-MSE hybrid loss function jointly optimizes MSE and binary cross-entropy (BCE), enhancing sensitivity to rare-category errors; third, multi-attribute feature fusion enables fine-grained per-category count estimation. Experiments across three counting benchmarks demonstrate substantial improvements in category-level accuracy—particularly in few-shot and zero-shot settings—while preserving overall count accuracy and significantly enhancing robustness to skewed category distributions. To our knowledge, this is the first work to jointly model category diversity and design adaptive loss functions for category-agnostic counting.

Technology Category

Application Category

📝 Abstract
In class-agnostic object counting, the goal is to estimate the total number of object instances in an image without distinguishing between specific categories. Existing methods often predict this count without considering class-specific outputs, leading to inaccuracies when such outputs are required. These inaccuracies stem from two key challenges: 1) the prevalence of single-category images in datasets, which leads models to generalize specific categories as representative of all objects, and 2) the use of mean squared error loss during training, which applies uniform penalization. This uniform penalty disregards errors in less frequent categories, particularly when these errors contribute minimally to the overall loss. To address these issues, we propose {FocalCount}, a novel approach that leverages diverse feature attributes to estimate the number of object categories in an image. This estimate serves as a weighted factor to correct class-count imbalances. Additionally, we introduce {Focal-MSE}, a new loss function that integrates binary cross-entropy to generate stronger error gradients, enhancing the model's sensitivity to errors in underrepresented categories. Our approach significantly improves the model's ability to distinguish between specific classes and general counts, demonstrating superior performance and scalability in both few-shot and zero-shot scenarios across three object counting datasets. The code will be released soon.
Problem

Research questions and friction points this paper is trying to address.

Class-agnostic object counting
Class-count imbalance
Underrepresented categories inaccuracy
Innovation

Methods, ideas, or system contributions that make the work stand out.

Leverages diverse feature attributes
Introduces Focal-MSE loss function
Corrects class-count imbalances effectively
🔎 Similar Papers
No similar papers found.
H
Huilin Zhu
Hubei Key Laboratory of Transportation Internet of Things, Wuhan University of Technology; School of Computing and Information Systems, Singapore Management University
J
Jingling Yuan
Hubei Key Laboratory of Transportation Internet of Things, Wuhan University of Technology
Zhengwei Yang
Zhengwei Yang
Wuhan University | A*STAR
Computer VisionCausal InferenceRe-identification
Y
Yu Guo
School of Navigation, Wuhan University of Technology; School of Computing and Information Systems, Singapore Management University
X
Xian Zhong
Hubei Key Laboratory of Transportation Internet of Things, Wuhan University of Technology
Shengfeng He
Shengfeng He
Singapore Management University
Visual ComputingGenerative ModelsComputer VisionComputational PhotographyComputer Graphics