Normalized Radon Cumulative Distribution Transforms for Invariance and Robustness in Optimal Transport Based Image Classification

📅 2025-06-10
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Few-shot image classification suffers from insufficient robustness against non-affine deformations and impulse noise, alongside uncertain class linear separability. Method: This paper proposes the normalized Radon Cumulative Distribution Transform (R-CDT), integrating Radon transform, cumulative distribution functions, and optimal transport theory. It introduces two novel normalization strategies: max-normalization—preserving strict linear separability under affine transformations—and mean-normalization—enhancing robustness to local non-affine deformations and impulse noise. Theoretical analysis establishes intrinsic connections between R-CDT and both Wasserstein-∞ and Wasserstein-2 distances. Results: Experiments demonstrate that R-CDT achieves high classification accuracy in few-shot settings and exhibits superior invariance and robustness under measurement distortions, notably in watermark recognition tasks.

Technology Category

Application Category

📝 Abstract
The Radon cumulative distribution transform (R-CDT), is an easy-to-compute feature extractor that facilitates image classification tasks especially in the small data regime. It is closely related to the sliced Wasserstein distance and provably guaranties the linear separability of image classes that emerge from translations or scalings. In many real-world applications, like the recognition of watermarks in filigranology, however, the data is subject to general affine transformations originating from the measurement process. To overcome this issue, we recently introduced the so-called max-normalized R-CDT that only requires elementary operations and guaranties the separability under arbitrary affine transformations. The aim of this paper is to continue our study of the max-normalized R-CDT especially with respect to its robustness against non-affine image deformations. Our sensitivity analysis shows that its separability properties are stable provided the Wasserstein-infinity distance between the samples can be controlled. Since the Wasserstein-infinity distance only allows small local image deformations, we moreover introduce a mean-normalized version of the R-CDT. In this case, robustness relates to the Wasserstein-2 distance and also covers image deformations caused by impulsive noise for instance. Our theoretical results are supported by numerical experiments showing the effectiveness of our novel feature extractors as well as their robustness against local non-affine deformations and impulsive noise.
Problem

Research questions and friction points this paper is trying to address.

Enhancing image classification robustness under affine transformations
Improving separability against non-affine image deformations
Developing normalized R-CDT versions for noise resilience
Innovation

Methods, ideas, or system contributions that make the work stand out.

Max-normalized R-CDT for affine invariance
Mean-normalized R-CDT for noise robustness
Wasserstein distance-based separability analysis
M
Matthias Beckmann
Center for Industrial Mathematics, University of Bremen, Germany & Department of Electrical and Electronic Engineering, Imperial College London, UK
R
Robert Beinert
Institut für Mathematik, Technische Universität Berlin, Germany
Jonas Bresch
Jonas Bresch
Wissenschaftlicher Mitarbeiter, TU Berlin
Image AnalysisOptimal TransportStochastic Linear Algebra