Redefining Quality Criteria and Distance-Aware Score Modeling for Image Editing Assessment

πŸ“… 2026-04-13
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF

career value

177K/year
πŸ€– AI Summary
Existing image editing quality assessment methods rely on handcrafted heuristic prompts, which struggle to generalize across diverse editing effects and fail to account for the continuity of the scoring space. To address these limitations, this work proposes DS-IEQA, a unified framework that adaptively learns evaluation criteria through a feedback-driven prompt optimization mechanism (FDMPO) and models the continuous structure of quality scores via a token-disentangled distance regression loss (TDRL). Leveraging a multimodal large language model, the proposed method achieves competitive performance without requiring additional training data, securing fourth place in Track 2 of the NTIRE 2026 X-AIGC Quality Assessment Challenge. This result demonstrates the framework’s effectiveness and strong generalization capability.

Technology Category

Application Category

πŸ“ Abstract
Recent advances in image editing have heightened the need for reliable Image Editing Quality Assessment (IEQA). Unlike traditional methods, IEQA requires complex reasoning over multimodal inputs and multi-dimensional assessments. Existing MLLM-based approaches often rely on human heuristic prompting, leading to two key limitations: rigid metric prompting and distance-agnostic score modeling. These issues hinder alignment with implicit human criteria and fail to capture the continuous structure of score spaces. To address this, we propose Define-and-Score Image Editing Quality Assessment (DS-IEQA), a unified framework that jointly learns evaluation criteria and score representations. Specifically, we introduce Feedback-Driven Metric Prompt Optimization (FDMPO) to automatically refine metric definitions via probabilistic feedback. Furthermore, we propose Token-Decoupled Distance Regression Loss (TDRL), which decouples numerical tokens from language modeling to explicitly model score continuity through expected distance minimization. Extensive experiments show our method's superior performance; it ranks 4th in the 2026 NTIRE X-AIGC Quality Assessment Track 2 without any additional training data.
Problem

Research questions and friction points this paper is trying to address.

Image Editing Quality Assessment
Metric Prompting
Score Continuity
Distance-Aware Modeling
Multimodal Reasoning
Innovation

Methods, ideas, or system contributions that make the work stand out.

Feedback-Driven Metric Prompt Optimization
Token-Decoupled Distance Regression Loss
Image Editing Quality Assessment
Score Continuity Modeling
Multimodal Reasoning
πŸ”Ž Similar Papers
No similar papers found.