🤖 AI Summary
This work challenges the prevailing assumption that the failure of noise correction methods stems solely from inaccurate estimation of the noise transition matrix. By introducing an oracle setting—where the true noise transition matrix is provided—the study demonstrates that performance collapse persists even under ideal conditions. To systematically diagnose the underlying failure mechanisms, the authors propose a tripartite analytical framework that integrates macroscopic convergence behavior, microscopic optimization dynamics, and information-theoretic learnability. Both theoretical analysis and empirical experiments reveal that the root cause lies in deeper structural issues rather than mere estimation errors, thereby offering a new perspective for developing robust learning methods under noisy labels.
📝 Abstract
Statistically consistent methods based on the noise transition matrix ($T$) offer a theoretically grounded solution to Learning with Noisy Labels (LNL), with guarantees of convergence to the optimal clean-data classifier. In practice, however, these methods are often outperformed by empirical approaches such as sample selection, and this gap is usually attributed to the difficulty of accurately estimating $T$. The common assumption is that, given a perfect $T$, noise-correction methods would recover their theoretical advantage. In this work, we put this longstanding hypothesis to a decisive test. We conduct experiments under idealized conditions, providing correction methods with a perfect, oracle transition matrix. Even under these ideal conditions, we observe that these methods still suffer from performance collapse during training. This compellingly demonstrates that the failure is not fundamentally a $T$-estimation problem, but stems from a more deeply rooted flaw. To explain this behaviour, we provide a unified analysis that links three levels: macroscopic convergence states, microscopic optimisation dynamics, and information-theoretic limits on what can be learned from noisy labels. Together, these results give a formal account of why ideal noise correction fails and offer concrete guidance for designing more reliable methods for learning with noisy labels.