Semi-supervised Concept Bottleneck Models

📅 2024-06-27
🏛️ arXiv.org
📈 Citations: 2
Influential: 0
📄 PDF

career value

230K/year
🤖 AI Summary
Existing concept bottleneck models (CBMs) heavily rely on high-quality, labor-intensive human concept annotations and frequently suffer from misalignment between concept saliency and input saliency. To address the challenge of scarce labeled data, this paper proposes a semi-supervised concept bottleneck model (SSCBM)—the first to integrate semi-supervised learning into the CBM framework. SSCBM introduces a concept-level pseudo-labeling strategy and a concept-space alignment loss to enforce consistency constraints at the concept level over unlabeled samples. By jointly optimizing on both labeled and unlabeled data, SSCBM achieves 93.19% concept accuracy and 75.51% prediction accuracy using only 20% of the labeled data, approaching fully supervised performance (96.39% / 79.82%). This significantly reduces expert annotation effort while mitigating concept–input misalignment.

Technology Category

Application Category

📝 Abstract
Concept Bottleneck Models (CBMs) have garnered increasing attention due to their ability to provide concept-based explanations for black-box deep learning models while achieving high final prediction accuracy using human-like concepts. However, the training of current CBMs heavily relies on the accuracy and richness of annotated concepts in the dataset. These concept labels are typically provided by experts, which can be costly and require significant resources and effort. Additionally, concept saliency maps frequently misalign with input saliency maps, causing concept predictions to correspond to irrelevant input features - an issue related to annotation alignment. To address these limitations, we propose a new framework called SSCBM (Semi-supervised Concept Bottleneck Model). Our SSCBM is suitable for practical situations where annotated data is scarce. By leveraging joint training on both labeled and unlabeled data and aligning the unlabeled data at the concept level, we effectively solve these issues. We proposed a strategy to generate pseudo labels and an alignment loss. Experiments demonstrate that our SSCBM is both effective and efficient. With only 20% labeled data, we achieved 93.19% (96.39% in a fully supervised setting) concept accuracy and 75.51% (79.82% in a fully supervised setting) prediction accuracy.
Problem

Research questions and friction points this paper is trying to address.

Reduces reliance on costly expert-annotated concept labels
Addresses misalignment between concept and input saliency maps
Improves performance with limited labeled data using semi-supervised learning
Innovation

Methods, ideas, or system contributions that make the work stand out.

Semi-supervised learning with labeled and unlabeled data
Concept-level alignment for improved accuracy
Pseudo label generation and alignment loss strategy
🔎 Similar Papers
No similar papers found.
Lijie Hu
Lijie Hu
Assistant Professor, MBZUAI
Explainable AILLMDifferential Privacy
T
Tianhao Huang
Provable Responsible AI and Data Analytics (PRADA) Lab, KAUST, Nankai University
H
Huanyi Xie
Provable Responsible AI and Data Analytics (PRADA) Lab, KAUST, Harbin Institute of Technology
C
Chenyang Ren
Provable Responsible AI and Data Analytics (PRADA) Lab, KAUST, Shanghai Jiao Tong University
Z
Zhengyu Hu
Provable Responsible AI and Data Analytics (PRADA) Lab, KAUST, HKUST
L
Lu Yu
Ant Group
D
Di Wang
Provable Responsible AI and Data Analytics (PRADA) Lab, KAUST