How Label Imbalance Shapes Geometry: A General Spectral Analysis of Multi-Label Neural Collapse

📅 2026-05-03
📈 Citations: 0
✹ Influential: 0
📄 PDF

career value

219K/year
đŸ€– AI Summary
This work addresses the challenge of characterizing terminal-state feature geometry in multi-label classification, where label imbalance and inter-label correlations induce geometric distortions that defy conventional neural collapse theory. The authors propose a spectral control framework that leverages the spectrum Îșₘ of the label covariance matrix to establish a fundamental lower bound on the terminal geometry dictated by the label distribution. They demonstrate that this spectral bound governs geometric stability and reveal that the commonly assumed class-mean orthogonality is merely a special case. Drawing upon spectral analysis, second-moment decomposition, and covariance spectrum theory, they derive a label-frequency-weighted prototype synthesis rule. Numerical experiments confirm the tightness of the proposed geometric bound, thereby establishing a unified neural collapse theory applicable to complex, imbalanced multi-label settings.
📝 Abstract
This work investigates the phenomenon of Neural Collapse (NC) in multi-label classification, extending its conceptual framework from multi-class learning to general correlated and imbalanced multi-label settings. Although recent studies have identified a ''tag-wise averaging'' structure for multi-label features, this view relies on implicit assumptions of label balance and combinatorial symmetry. Consequently, it fails to account for the geometrical distortions caused by intrinsic label correlations and data imbalance, which are common in practice. We resolve the multiplicity-one imbalance conjecture raised by Li et al. (2024), showing that higher-multiplicity prototypes obey a class-frequency-weighted synthesis rule rather than uniform averaging. To address this, we propose a rigorous spectral-control framework to analyze the terminal phase of multi-label learning under general imbalanced conditions. We introduce the label covariance spectrum $Îș_m$, a scalar controlling the distribution-dependent lower-bound geometry, derived from the second-order moment matrix of the label distribution. Contrary to the averaging perspective, our analysis reveals that the centered label covariance spectrum controls the stability of terminal geometry by quantifying the weakest centered inter-class contrast directions. We prove that the classical Tag-wise Averaging emerges only as a special case under perfect orthogonality. Numerical experiments on synthetic distributions validate our theoretical bounds. This work resolves the scaled-average aspect of the imbalance conjecture and establishes a unifying theoretical framework that extends Neural Collapse to complex, imbalanced multi-label settings.
Problem

Research questions and friction points this paper is trying to address.

Neural Collapse
multi-label classification
label imbalance
label correlation
feature geometry
Innovation

Methods, ideas, or system contributions that make the work stand out.

Neural Collapse
multi-label classification
label imbalance
spectral analysis
label covariance spectrum
🔎 Similar Papers
2024-05-09Computer Vision and Pattern RecognitionCitations: 7