RAID-Database: human Responses to Affine Image Distortions

📅 2024-12-13

🏛️ arXiv.org

📈 Citations: 0

✨ Influential: 0

career value

216K/year

🤖 AI Summary

Existing image quality databases predominantly focus on digital distortions, neglecting naturally occurring affine transformations (rotation, translation, scaling) and Gaussian noise prevalent in real-world scenes. Method: We introduce the first large-scale subjective image quality database, systematically collecting over 20,000 quadruple-comparison responses from 105 observers on 864 images degraded by affine + Gaussian distortions. Perceptual scales are derived rigorously using Maximum Likelihood Difference Scaling (MLDS). Contribution/Results: We empirically validate Piéron’s law for suprathreshold affine distortions—revealing significantly elevated detection thresholds compared to classical models. Our database achieves superior performance on the Group-MAD benchmark versus mainstream alternatives. It fills a critical gap in natural distortion modeling and provides a high-quality, reproducible resource for training image quality assessment models and advancing research into visual perception mechanisms.

Technology Category

Application Category

📝 Abstract

Image quality databases are used to train models for predicting subjective human perception. However, most existing databases focus on distortions commonly found in digital media and not in natural conditions. Affine transformations are particularly relevant to study, as they are among the most commonly encountered by human observers in everyday life. This Data Descriptor presents a set of human responses to suprathreshold affine image transforms (rotation, translation, scaling) and Gaussian noise as convenient reference to compare with previously existing image quality databases. The responses were measured using well established psychophysics: the Maximum Likelihood Difference Scaling method. The set contains responses to 864 distorted images. The experiments involved 105 observers and more than 20000 comparisons of quadruples of images. The quality of the dataset is ensured because (a) it reproduces the classical Pi'eron's law, (b) it reproduces classical absolute detection thresholds, and (c) it is consistent with conventional image quality databases but improves them according to Group-MAD experiments.

Problem

Research questions and friction points this paper is trying to address.

Lack of databases for human responses to natural affine image distortions.

Need for reference data on human perception of affine transformations and noise.

Improvement of existing image quality databases using psychophysical methods.

Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses Maximum Likelihood Difference Scaling method

Includes 864 affine transformed images

Involves 105 observers, 20000 image comparisons

🔎 Similar Papers

EmoEdit: Evoking Emotions through Image Manipulation

2024-05-21arXiv.orgCitations: 2

Make Me Happier: Evoking Emotions Through Image Diffusion Models

2024-03-13arXiv.orgCitations: 3