Exploring Partial Multi-Label Learning via Integrating Semantic Co-occurrence Knowledge

📅 2025-07-08
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses multi-label learning under incomplete supervision—where labels may be correct, incorrect, or unknown. To model the ambiguous and complex dependencies between instances and labels, we propose the Semantic Co-occurrence Insight Network (SCINet). SCINet integrates a dual-dominant prompting module, a cross-modal feature fusion mechanism, and an intrinsic semantic enhancement strategy. It jointly leverages pre-trained multimodal foundations, text–image correlation modeling, label co-occurrence pattern mining, and diverse image augmentations. Extensive experiments on four mainstream benchmark datasets demonstrate that SCINet significantly outperforms existing state-of-the-art methods in classification accuracy. Moreover, it exhibits superior robustness and discriminative capability under noisy and missing-label conditions, validating its effectiveness in handling real-world weak supervision scenarios.

Technology Category

Application Category

📝 Abstract
Partial multi-label learning aims to extract knowledge from incompletely annotated data, which includes known correct labels, known incorrect labels, and unknown labels. The core challenge lies in accurately identifying the ambiguous relationships between labels and instances. In this paper, we emphasize that matching co-occurrence patterns between labels and instances is key to addressing this challenge. To this end, we propose Semantic Co-occurrence Insight Network (SCINet), a novel and effective framework for partial multi-label learning. Specifically, SCINet introduces a bi-dominant prompter module, which leverages an off-the-shelf multimodal model to capture text-image correlations and enhance semantic alignment. To reinforce instance-label interdependencies, we develop a cross-modality fusion module that jointly models inter-label correlations, inter-instance relationships, and co-occurrence patterns across instance-label assignments. Moreover, we propose an intrinsic semantic augmentation strategy that enhances the model's understanding of intrinsic data semantics by applying diverse image transformations, thereby fostering a synergistic relationship between label confidence and sample difficulty. Extensive experiments on four widely-used benchmark datasets demonstrate that SCINet surpasses state-of-the-art methods.
Problem

Research questions and friction points this paper is trying to address.

Extracting knowledge from incompletely annotated multi-label data
Identifying ambiguous label-instance relationships accurately
Matching label-instance co-occurrence patterns effectively
Innovation

Methods, ideas, or system contributions that make the work stand out.

Bi-dominant prompter module captures text-image correlations
Cross-modality fusion models instance-label interdependencies
Intrinsic semantic augmentation enhances data semantics understanding
🔎 Similar Papers
No similar papers found.
X
Xin Wu
School of Computing and Artificial Intelligence, Southwest Jiaotong University, Chengdu, Sichuan 611756, China
Fei Teng
Fei Teng
Reader in Intelligent Energy Systems, Imperial College London
Stability-constrained OptimisationCyber-resilient System OperationData Privacy and Trading
Y
Yue Feng
School of Engineering, Swinburne University of Technology, Hawthorn, Victoria 3122, Australia; School of Electronic and Information Engineering, Wuyi University, Jiangmen, Guangdong 510006, China
Kaibo Shi
Kaibo Shi
Chengdu University
Neural NetworkTime-delay systemNonlinear DynamicsStability TheoryControl System
Z
Zhuosheng Lin
School of Electronic and Information Engineering, Wuyi University, Jiangmen, Guangdong 510006, China
J
Ji Zhang
School of Computing and Artificial Intelligence, Southwest Jiaotong University, Chengdu, Sichuan 611756, China
James Wang
James Wang
Columbia University
Columbia University