ClearGCD: Mitigating Shortcut Learning For Robust Generalized Category Discovery

📅 2025-11-28
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Open-world generalized category discovery (GCD) suffers from prototype confusion caused by shortcut learning, leading to catastrophic forgetting of known classes and degraded discriminability for novel classes. To address this, we propose a semantic consistency enhancement and dynamic prototype alignment framework. Specifically, we design a semantic view alignment mechanism: weak augmentations preserve semantic stability, while strong augmentations—generated via cross-class image patch replacement—suppress non-semantic shortcuts. We further introduce shortcut-suppressing regularization and an adaptive prototype memory bank to support contrastive learning. Our method consistently outperforms state-of-the-art approaches across multiple benchmarks. It effectively retains knowledge of known classes while significantly improving separation of novel categories. The framework demonstrates strong generalization and robustness, and is plug-and-play compatible with various parametric GCD architectures.

Technology Category

Application Category

📝 Abstract
In open-world scenarios, Generalized Category Discovery (GCD) requires identifying both known and novel categories within unlabeled data. However, existing methods often suffer from prototype confusion caused by shortcut learning, which undermines generalization and leads to forgetting of known classes. We propose ClearGCD, a framework designed to mitigate reliance on non-semantic cues through two complementary mechanisms. First, Semantic View Alignment (SVA) generates strong augmentations via cross-class patch replacement and enforces semantic consistency using weak augmentations. Second, Shortcut Suppression Regularization (SSR) maintains an adaptive prototype bank that aligns known classes while encouraging separation of potential novel ones. ClearGCD can be seamlessly integrated into parametric GCD approaches and consistently outperforms state-of-the-art methods across multiple benchmarks.
Problem

Research questions and friction points this paper is trying to address.

Mitigates shortcut learning in category discovery
Reduces prototype confusion for better generalization
Enhances identification of known and novel classes
Innovation

Methods, ideas, or system contributions that make the work stand out.

Cross-class patch replacement for semantic view alignment
Adaptive prototype bank for shortcut suppression regularization
Seamless integration into parametric GCD frameworks
🔎 Similar Papers
No similar papers found.
K
Kailin Lyu
Institute of Automation, Chinese Academy of Sciences, Beijing, China
J
Jianwei He
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Long Xiao
Long Xiao
University of Cambridge, Engineering Department, Cavendish Laboratory
GraphenePhotonicsTerahertzCommunication System
J
Jianing Zeng
Institute of Automation, Chinese Academy of Sciences, Beijing, China
L
Liang Fan
Loughborough University, Loughborough, United Kingdom
L
Lin Shu
Institute of Automation, Chinese Academy of Sciences, Beijing, China
J
Jie Hao
Institute of Automation, Chinese Academy of Sciences, Beijing, China