Improved Feature Generating Framework for Transductive Zero-shot Learning

📅 2024-12-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In zero-shot learning (ZSL), prior bias toward unseen classes impairs discriminative accuracy of generated features—especially when generators are imperfect—while unconditional discriminators introduce noise. To address these challenges in transductive ZSL (TZSL), we propose I-VAEGAN, a novel framework integrating variational autoencoders (VAEs) and generative adversarial networks (GANs). Our approach introduces two key innovations: pseudo-conditioned feature adversarial (PFA) learning and variational embedding regression (VER), which jointly eliminate reliance on unseen-class priors and enable semantics-driven, precise conditional generation alongside class-wise statistical modeling. I-VAEGAN unifies pseudo-conditioned discrimination, semantic regression, reconstruction-based pretraining, and conditional adversarial training. Evaluated on multiple TZSL benchmarks, it achieves state-of-the-art performance, significantly improving unseen-class classification accuracy across diverse prior settings, while demonstrating superior generalization and robustness.

Technology Category

Application Category

📝 Abstract
Feature Generative Adversarial Networks have emerged as powerful generative models in producing high-quality representations of unseen classes within the scope of Zero-shot Learning (ZSL). This paper delves into the pivotal influence of unseen class priors within the framework of transductive ZSL (TZSL) and illuminates the finding that even a marginal prior bias can result in substantial accuracy declines. Our extensive analysis uncovers that this inefficacy fundamentally stems from the utilization of an unconditional unseen discriminator - a core component in existing TZSL. We further establish that the detrimental effects of this component are inevitable unless the generator perfectly fits class-specific distributions. Building on these insights, we introduce our Improved Feature Generation Framework, termed I-VAEGAN, which incorporates two novel components: Pseudo-conditional Feature Adversarial (PFA) learning and Variational Embedding Regression (VER). PFA circumvents the need for prior estimation by explicitly injecting the predicted semantics as pseudo conditions for unseen classes premised by precise semantic regression. Meanwhile, VER utilizes reconstructive pre-training to learn class statistics, obtaining better semantic regression. Our I-VAEGAN achieves state-of-the-art TZSL accuracy across various benchmarks and priors. Our code would be released upon acceptance.
Problem

Research questions and friction points this paper is trying to address.

Zero-shot Learning
Unseen Class Bias
Conditional Discriminator
Innovation

Methods, ideas, or system contributions that make the work stand out.

I-VAEGAN
Pseudo Conditional Feature Adversarial Learning
Variational Embedding Regression
🔎 Similar Papers
No similar papers found.
Z
Zihan Ye
Xi’an Jiaotong-Liverpool University; University of Liverpool; Westlake University
X
Xinyuan Ru
Xi’an Jiaotong-Liverpool University; University of Liverpool
Shiming Chen
Shiming Chen
Washington University
Photoreceptor gene expression
Y
Yaochu Jin
Westlake University
Kaizhu Huang
Kaizhu Huang
Professor, Duke Kunshan University
Generalization & RobustnessStatistical Learning ThoeryTrustworthy AI
X
Xiaobo Jin
Xi’an Jiaotong-Liverpool University