Interpretable Zero-shot Learning with Infinite Class Concepts

📅 2025-05-06
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Zero-shot learning (ZSL) suffers from two critical challenges: semantic unreliability—stemming from LLM hallucinations that produce non-visual, implausible concepts—and decision opacity. To address these, we propose a dynamic phrase-level visual concept generation framework. Our approach is the first to model class semantics as an infinite, interpretable, and image-groundable set of descriptive phrases. We introduce an entropy-driven “quality” filtering mechanism to suppress hallucinations while preserving concept transferability and discriminability. The framework integrates LLM-based dynamic phrase generation, entropy-weighted scoring, cross-modal alignment training, and visualization-enabled interpretability analysis. Evaluated on three standard ZSL benchmarks, our method achieves significant accuracy improvements over state-of-the-art methods. Crucially, it generates highly interpretable, visually grounded class concepts, enabling human-traceable, transparent reasoning—thereby bridging the gap between generative semantics and reliable visual recognition.

Technology Category

Application Category

📝 Abstract
Zero-shot learning (ZSL) aims to recognize unseen classes by aligning images with intermediate class semantics, like human-annotated concepts or class definitions. An emerging alternative leverages Large-scale Language Models (LLMs) to automatically generate class documents. However, these methods often face challenges with transparency in the classification process and may suffer from the notorious hallucination problem in LLMs, resulting in non-visual class semantics. This paper redefines class semantics in ZSL with a focus on transferability and discriminability, introducing a novel framework called Zero-shot Learning with Infinite Class Concepts (InfZSL). Our approach leverages the powerful capabilities of LLMs to dynamically generate an unlimited array of phrase-level class concepts. To address the hallucination challenge, we introduce an entropy-based scoring process that incorporates a ``goodness"concept selection mechanism, ensuring that only the most transferable and discriminative concepts are selected. Our InfZSL framework not only demonstrates significant improvements on three popular benchmark datasets but also generates highly interpretable, image-grounded concepts. Code will be released upon acceptance.
Problem

Research questions and friction points this paper is trying to address.

Enhancing transparency in zero-shot learning classification process
Addressing hallucination issues in LLM-generated class semantics
Improving transferability and discriminability of class concepts
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dynamic generation of infinite class concepts using LLMs
Entropy-based scoring for hallucination-free concept selection
Image-grounded interpretable ZSL framework with transferability focus
🔎 Similar Papers
No similar papers found.
Z
Zihan Ye
Xian Jiaotong-Liverpool University
S
Shreyank N. Gowda
University of Nottingham
Shiming Chen
Shiming Chen
Washington University
Photoreceptor gene expression
Y
Yaochu Jin
Westlake University
Kaizhu Huang
Kaizhu Huang
Professor, Duke Kunshan University
Generalization & RobustnessStatistical Learning ThoeryTrustworthy AI
X
Xiaobo Jin
Xian Jiaotong-Liverpool University