Hyperbolic Category Discovery

📅 2025-04-08
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Generalized Category Discovery (GCD) aims to jointly identify all unlabeled images—including those from both known and unknown classes—in an open-world setting with inherent hierarchical structure. Conventional Euclidean or spherical representations struggle to capture such intrinsic hierarchies. This work pioneers the adaptation of GCD to hyperbolic space, proposing a unified framework that jointly learns hierarchy-aware representations and classifiers in the Poincaré ball. We introduce a novel joint metric combining hyperbolic distance and angular similarity to enhance knowledge transfer from known to unknown classes. Our approach integrates Poincaré embedding, Euclidean-to-hyperbolic mapping, and self-supervised backbone fine-tuning. Extensive experiments on multiple GCD benchmarks demonstrate significant improvements over state-of-the-art methods, validating the effectiveness and superiority of hyperbolic geometry for hierarchical category discovery in open-world scenarios.

Technology Category

Application Category

📝 Abstract
Generalized Category Discovery (GCD) is an intriguing open-world problem that has garnered increasing attention. Given a dataset that includes both labelled and unlabelled images, GCD aims to categorize all images in the unlabelled subset, regardless of whether they belong to known or unknown classes. In GCD, the common practice typically involves applying a spherical projection operator at the end of the self-supervised pretrained backbone, operating within Euclidean or spherical space. However, both of these spaces have been shown to be suboptimal for encoding samples that possesses hierarchical structures. In contrast, hyperbolic space exhibits exponential volume growth relative to radius, making it inherently strong at capturing the hierarchical structure of samples from both seen and unseen categories. Therefore, we propose to tackle the category discovery challenge in the hyperbolic space. We introduce HypCD, a simple underline{Hyp}erbolic framework for learning hierarchy-aware representations and classifiers for generalized underline{C}ategory underline{D}iscovery. HypCD first transforms the Euclidean embedding space of the backbone network into hyperbolic space, facilitating subsequent representation and classification learning by considering both hyperbolic distance and the angle between samples. This approach is particularly helpful for knowledge transfer from known to unknown categories in GCD. We thoroughly evaluate HypCD on public GCD benchmarks, by applying it to various baseline and state-of-the-art methods, consistently achieving significant improvements.
Problem

Research questions and friction points this paper is trying to address.

GCD categorizes unlabeled images, including unknown classes
Euclidean and spherical spaces poorly encode hierarchical structures
Hyperbolic space better captures hierarchical sample structures
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hyperbolic space for hierarchical representation
Combines hyperbolic distance and angle
Transforms Euclidean to hyperbolic space
🔎 Similar Papers
No similar papers found.
Y
Yuanpei Liu
Visual AI Lab, The University of Hong Kong
Zhenqi He
Zhenqi He
The Hong Kong University of Science and Technology (HKUST) | The University of Hong Kong (HKU)
Open-World LearningComputer VisionMulti-Modal Learning
K
Kai Han
Visual AI Lab, The University of Hong Kong