VQualA 2025 Challenge on Visual Quality Comparison for Large Multimodal Models: Methods and Results

📅 2025-09-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Large multimodal models (LMMs) exhibit limited capability in open-domain, multi-image visual quality comparison and fine-grained reasoning. Method: We introduce Multi-Quality-Bench—the first hierarchical visual quality assessment benchmark tailored for LMMs—comprising single-image, two-alternative forced-choice (2AFC), and multiple-choice (MCQ) tasks, with thousands of progressively refined evaluation samples. Our approach employs a human-perception-aligned, interpretable evaluation framework integrating instruction-tuned LMMs with joint binary preference and MCQ evaluation paradigms. Contribution/Results: We launched an international challenge attracting nearly 100 participating teams; five top-performing models demonstrated the efficacy of instruction tuning for visual quality assessment. Multi-Quality-Bench establishes a standardized, reproducible foundation for rigorous, large-scale evaluation and advances systematic research in LMM-based visual quality understanding.

Technology Category

Application Category

📝 Abstract
This paper presents a summary of the VQualA 2025 Challenge on Visual Quality Comparison for Large Multimodal Models (LMMs), hosted as part of the ICCV 2025 Workshop on Visual Quality Assessment. The challenge aims to evaluate and enhance the ability of state-of-the-art LMMs to perform open-ended and detailed reasoning about visual quality differences across multiple images. To this end, the competition introduces a novel benchmark comprising thousands of coarse-to-fine grained visual quality comparison tasks, spanning single images, pairs, and multi-image groups. Each task requires models to provide accurate quality judgments. The competition emphasizes holistic evaluation protocols, including 2AFC-based binary preference and multi-choice questions (MCQs). Around 100 participants submitted entries, with five models demonstrating the emerging capabilities of instruction-tuned LMMs on quality assessment. This challenge marks a significant step toward open-domain visual quality reasoning and comparison and serves as a catalyst for future research on interpretable and human-aligned quality evaluation systems.
Problem

Research questions and friction points this paper is trying to address.

Evaluating LMMs' visual quality comparison abilities
Creating benchmark for coarse-to-fine quality tasks
Developing holistic evaluation protocols for quality assessment
Innovation

Methods, ideas, or system contributions that make the work stand out.

Novel benchmark with coarse-to-fine tasks
Holistic evaluation using 2AFC and MCQs
Instruction-tuned LMMs for quality assessment
🔎 Similar Papers
No similar papers found.
H
Hanwei Zhu
challenge organizers
Haoning Wu
Haoning Wu
Shanghai Jiao Tong University
Computer VisionMulti-modal LearningGenerative Models
Z
Zicheng Zhang
challenge organizers
L
Lingyu Zhu
challenge organizers
Y
Yixuan Li
challenge organizers
Peilin Chen
Peilin Chen
University of Virginia
AI ChipsIn-Memory ComputingComputer Architecture
S
Shiqi Wang
challenge organizers
C
Chris Wei Zhou
challenge organizers
Linhan Cao
Linhan Cao
Shanghai Jiao Tong University
Image Quality Assessment Video Quality Assessment
W
Wei Sun
participants of the VQualA 2025 Challenge
X
Xiangyang Zhu
participants of the VQualA 2025 Challenge
W
Weixia Zhang
participants of the VQualA 2025 Challenge
Yucheng Zhu
Yucheng Zhu
Shanghai Jiaotong University
Multimedia Signal Processing
J
Jing Liu
participants of the VQualA 2025 Challenge
D
Dandan Zhu
participants of the VQualA 2025 Challenge
Guangtao Zhai
Guangtao Zhai
Professor, IEEE Fellow, Shanghai Jiao Tong University
Multimedia Signal ProcessingVisual Quality AssessmentQoEAI EvaluationDisplays
X
Xiongkuo Min
participants of the VQualA 2025 Challenge
Zhichao Zhang
Zhichao Zhang
School of Mathematics and Statistics, NUIST
Graph Signal ProcessingGraph Neural NetworkImage Processing
X
Xinyue Li
participants of the VQualA 2025 Challenge
S
Shubo Xu
participants of the VQualA 2025 Challenge
Anh Dao
Anh Dao
Undergraduate Student, Michigan State University
Vision-languageMultimodal LLMEmbodied AILLM
Y
Yifan Li
participants of the VQualA 2025 Challenge
H
Hongyuan Yu
participants of the VQualA 2025 Challenge
J
Jiaojiao Yi
participants of the VQualA 2025 Challenge
Y
Yiding Tian
participants of the VQualA 2025 Challenge